Systems and methods for smart dialogue communication

ABSTRACT

Systems and methods for smart dialogue communication are provided. A method may include receiving, from a responder terminal device, a dialogue request configured to request a smart dialogue communication, wherein the dialogue request is associated with an incoming call request that is initiated by a requester via a requester terminal device and satisfies a smart dialogue condition determined by the responder terminal device; performing the smart dialogue communication with the requester terminal device associated with the requester; recording voice information associated with the smart dialogue communication; converting the voice information into the text information; and transmitting the text information to the responder terminal device.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application a continuation of U.S. application Ser. No. 17/238,161filed on Apr. 22, 2021, which is a continuation of U.S. application Ser.No. 16/889,823 (issued as U.S. patent Ser. No. 11/019,207) filed on Jun.2, 2020, which claims priority to Chinese Application No.201911082387.9, filed on Nov. 7, 2019, Chinese Application No.201911082383.0, filed on Nov. 7, 2019, Chinese Application No.201911082413.8, filed on Nov. 7, 2019, Chinese Application No.201911083330.0, filed on Nov. 7, 2019, and Chinese Application No.201911082348.9, filed on Nov. 7, 2019, the entire contents of each ofwhich are hereby incorporated by reference.

TECHNICAL FIELD

The present disclosure generally relates to the field of communicationtechnology, and in particular, to systems and methods for smart dialoguecommunication.

BACKGROUND

With the development of communication technology, users may need moreprofessional, personal, and intelligent communication services.Generally, a user may first preset a transferring number, and anincoming call can be transferred to the transferring number when theuser is not convenient to answer the incoming call. However, in somecases, some incoming calls may be missed, for example, when thetransferring number is in a no signal environment. Besides, due toinformation leakage, a user may receive a great amount of harassingcalls, which causes inconvenience to the user. Thus, it is desirable toprovide systems and/or methods for smart dialogue communication.

SUMMARY

An aspect of the present disclosure introduces a system for smartdialogue communication is provided. The system includes at least onestorage device including a set of instructions and at least oneprocessor in communication with the at least one storage device. Whenexecuting the set of instructions, the at least one processor isconfigured to cause the system to perform operations including:receiving, from a responder terminal device, a dialogue requestconfigured to request a smart dialogue communication, wherein thedialogue request is associated with an incoming call request that isinitiated by a requester via a requester terminal device and satisfies asmart dialogue condition determined by the responder terminal device;performing the smart dialogue communication with the requester terminaldevice associated with the requester; recording voice informationassociated with the smart dialogue communication; converting the voiceinformation into the text information; and transmitting the textinformation to the responder terminal device.

In some embodiments, the smart dialogue condition is associated with atleast one of: whether an incoming call number associated with theincoming call request exists in a contact list of the responder terminaldevice; whether the incoming call number exists in a blacklist of theresponder terminal device; or whether a request time length of theincoming call request exceeds a time threshold.

In some embodiments, the at least one processor is further configured tocause the system to perform operations including: obtaining, from therequester terminal device, one or more first incoming voice messagesassociated with the smart dialogue communication; determining one ormore feature words based on the one or more first incoming voicemessages; and determining identity information by matching the one ormore feature words in an identity database.

In some embodiments, performing the smart dialogue communicationincludes: obtaining, from the requester terminal device, one or moresecond incoming voice messages; converting the one or more secondincoming voice messages into one or more incoming text messages;determining text features by performing feature extraction on at leastone of the one or more incoming text messages; determining questioninformation associated with the text features by matching the textfeatures in a question knowledge database; obtaining answer informationcorresponding to the question information by matching the questioninformation in an answer knowledge database; converting the answerinformation into at least one reply voice message; and transmitting theat least one reply voice message to the requester terminal device.

In some embodiments, the at least one processor is further configured tocause the system to perform operations including: determining sentimentfeatures by performing sentiment analysis on the text information;determining mood features by performing feature extraction on the textinformation; determining integrated features by integrating thesentiment features and the mood features; determining an emotion type bymatching the integrated features in an emotion database; andtransmitting the emotion type to the responder terminal deviceassociated with the incoming call request.

In some embodiments, the at least one processor is further configured tocause the system to perform operations including: generating suggestioninformation based on the emotion type, the suggestion information beingassociated with whether to respond to the incoming call request; andtransmitting the suggestion information to the responder terminaldevice.

In some embodiments, the at least one processor is further configured tocause the system to perform operations including: determining whetherprompt information is received, the prompt information being associatedwith a reply instruction provided by the responder terminal device; andin response to determining that the prompt information is received,converting the prompt information into a reply voice message; andtransmitting the reply voice message to the requester terminal device.

In some embodiments, before performing the smart dialogue communication,the at least one processor is further configured to cause the system toperform operations including: determining whether an answer instructioncorresponding to the incoming call request is received from theresponder terminal device; and in response to determining that theanswer instruction is not received, performing the smart dialoguecommunication.

In some embodiments, the at least one processor is further configured tocause the system to perform operations including: determining eventkeywords by performing semantic analysis on the text information;generating schedule information based on the event keywords; andtransmitting the schedule information to the responder terminal deviceassociated with the incoming call request for display.

In some embodiments, the determining the event keywords by performingsemantic analysis on the text information includes: processing the textinformation by performing word segmentation and annotation on the textinformation; determining text features by performing feature extractionon the processed text information using a language model; anddetermining the event keywords by performing semantic analysis on thetext features using a semantic analysis model.

In some embodiments, the determining the event keywords includes:obtaining time information by performing the semantic analysis on thetext information; determining whether the time information includes anabsolute time; and in response to determining that the time informationincludes the absolute time, designating the time information as an eventtime; or in response to determining that the time information includesno absolute time, determining the event time based on the timeinformation and a system time.

In some embodiments, the at least one processor is further configured tocause the system to perform operations including: determining anemergency level of the schedule information according to a levelclassification rule; determining a reminder time of the scheduleinformation based on the emergency level; and causing the responderterminal device to generate a reminder at the reminder time.

According to another aspect of the present disclosure, a method forsmart dialogue communication is provided. The method includes receiving,from a responder terminal device, a dialogue request configured torequest a smart dialogue communication, wherein the dialogue request isassociated with an incoming call request that is initiated by arequester via a requester terminal device and satisfies a smart dialoguecondition determined by the responder terminal device; performing thesmart dialogue communication with the requester terminal deviceassociated with the requester; recording voice information associatedwith the smart dialogue communication; converting the voice informationinto the text information; and transmitting the text information to theresponder terminal device.

According to still another aspect of the present disclosure, anon-transitory computer-readable medium, comprising at least one set ofinstructions compatible for smart dialogue communication, is provided.When executed by at least one processor of an electrical device, the atleast one set of instructions directs the at least one processor toperform a method including receiving, from a responder terminal device,a dialogue request configured to request a smart dialogue communication,wherein the dialogue request is associated with an incoming call requestthat is initiated by a requester via a requester terminal device andsatisfies a smart dialogue condition determined by the responderterminal device; performing the smart dialogue communication with therequester terminal device associated with the requester; recording voiceinformation associated with the smart dialogue communication; convertingthe voice information into the text information; and transmitting thetext information to the responder terminal device.

According to still another aspect of the present disclosure, a systemfor smart dialogue communication is provided. The system includes atleast one storage device including a set of instructions; and at leastone processor in communication with the at least one storage device.When executing the set of instructions, the at least one processor isconfigured to cause the system to perform operations including:performing, based on a dialogue request, a smart dialogue communicationwith a requester terminal device associated with a requester, thedialogue request being associated with an incoming call requestinitiated by the requester via the requester terminal device; obtainingreal-time dialogue information associated with the smart dialoguecommunication; transmitting the real-time dialogue information to aresponder terminal device; determining whether prompt information isreceived, the prompt information being associated with a replyinstruction provided, based on the real-time dialogue information, bythe responder terminal device; and in response to determining that theprompt information is received, converting the prompt information into areply voice message; and transmitting the reply voice message to therequester terminal device.

According to still another aspect of the present disclosure, a systemfor smart dialogue communication is provided. The system includes atleast one storage device including a set of instructions; and at leastone processor in communication with the at least one storage device.When executing the set of instructions, the at least one processor isconfigured to cause the system to perform operations including:performing, based on a dialogue request, a smart dialogue communicationwith a requester terminal device associated with a requester, thedialogue request being associated with an incoming call requestinitiated by the requester via the requester terminal device; obtainingincoming voice messages associated with the smart dialoguecommunication; converting the incoming voice messages into incoming textmessages; determining event keywords by performing semantic analysis onthe incoming text messages; generating schedule information based on theevent keywords; and transmitting the schedule information to a responderterminal device associated with the incoming call request for display.

According to still another aspect of the present disclosure, a systemfor smart dialogue communication is provided. The system includes atleast one storage device including a set of instructions; and at leastone processor in communication with the at least one storage device.When executing the set of instructions, the at least one processor isconfigured to cause the system to perform operations including:performing, based on a dialogue request, a smart dialogue communicationwith a requester terminal device associated with a requester, thedialogue request being associated with an incoming call requestinitiated by the requester via the requester terminal device; obtainingincoming voice messages associated with the smart dialoguecommunication; converting the incoming voice messages into incoming textmessages; determining sentiment features by performing sentimentanalysis on the incoming text messages; determining mood features byperforming feature extraction on the incoming voice messages;determining integrated features by integrating the sentiment featuresand the mood features; determining an emotion type by matching theintegrated features in an emotion database; and transmitting the emotiontype to a responder terminal device associated with the incoming callrequest.

According to still another aspect of the present disclosure, a systemfor smart dialogue communication is provided. The system includes atleast one storage device including a set of instructions; and at leastone processor in communication with the at least one storage device.When executing the set of instructions, the at least one processor isconfigured to cause the system to perform operations including:receiving an incoming call request initiated by a requester via arequester terminal device; determining whether the incoming call requestsatisfies a smart dialogue condition; and in response to determiningthat the incoming call request satisfies the smart dialogue condition,generating, based on the incoming call request, a dialogue requestconfigured to request a smart dialogue communication; causing a smartdialogue communication platform to perform the smart dialoguecommunication with the requester terminal device associated with therequester by transmitting the dialogue request to the smart dialoguecommunication platform; receiving, from the smart dialogue communicationplatform, dialogue information associated with the smart dialoguecommunication; and outputting the dialogue information; wherein thedialogue information includes text information, and the text informationis generated by the smart dialogue communication platform according to afirst process including: recording voice information associated with thesmart dialogue communication; and converting the voice information intothe text information.

According to still another aspect of the present disclosure, a systemfor smart dialogue communication is provided. The system includes atleast one storage device including a set of instructions; and at leastone processor in communication with the at least one storage device.When executing the set of instructions, the at least one processor isconfigured to cause the system to perform operations including:receiving an incoming call request initiated by a requester via arequester terminal device; determining whether the incoming call requestsatisfies a smart dialogue condition; in response to determining thatthe incoming call request satisfies the smart dialogue condition,generating, based on the incoming call request, a dialogue requestconfigured to request a smart dialogue communication; causing a smartdialogue communication platform to perform the smart dialoguecommunication with the requester terminal device associated with therequester by transmitting the dialogue request to the smart dialoguecommunication platform; and receiving, from the smart dialoguecommunication platform, identity information of the requester; andoutputting the identity information; wherein the identity information isdetermined by the smart dialogue communication platform according to afirst process including: obtaining, from the requester terminal device,one or more incoming voice messages associated with the smart dialoguecommunication; determining one or more feature words based on the one ormore incoming voice messages; and determining the identity informationby matching the one or more feature words in an identity database.

Additional features will be set forth in part in the description whichfollows, and in part will become apparent to those skilled in the artupon examination of the following and the accompanying drawings or maybe learned by production or operation of the examples. The features ofthe present disclosure may be realized and attained by practice or useof various aspects of the methodologies, instrumentalities andcombinations set forth in the detailed examples discussed below.

BRIEF DESCRIPTION OF THE DRAWINGS

The present disclosure is further described in terms of exemplaryembodiments. These exemplary embodiments are described in detail withreference to the drawings. The drawings are not to scale. Theseembodiments are non-limiting schematic embodiments, in which likereference numerals represent similar structures throughout the severalviews of the drawings, and wherein:

FIG. 1 is a schematic diagram illustrating an exemplary smart dialoguecommunication system according to some embodiments of the presentdisclosure;

FIG. 2 is a schematic diagram illustrating exemplary hardware and/orsoftware components of a computing device according to some embodimentsof the present disclosure;

FIG. 3 is a schematic diagram illustrating exemplary hardware and/orsoftware components of a mobile device according to some embodiments ofthe present disclosure;

FIG. 4 is a block diagram illustrating an exemplary processing deviceaccording to some embodiments of the present disclosure;

FIG. 5 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 6 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 7 is a schematic diagram illustrating an exemplary system for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 8 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 9 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 10 is a schematic diagram illustrating an exemplary system forsmart dialogue communication according to some embodiments of thepresent disclosure;

FIG. 11 is a block diagram illustrating an exemplary device for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 12 is a flowchart illustrating an exemplary process for generatingschedule information according to some embodiments of the presentdisclosure;

FIG. 13 is a schematic diagram illustrating an exemplary system forgenerating schedule information according to some embodiments of thepresent disclosure;

FIG. 14 is a block diagram illustrating an exemplary device for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 15 is a flowchart illustrating an exemplary process for determiningan emotion type of a requester according to some embodiments of thepresent disclosure;

FIG. 16 is a schematic diagram illustrating an exemplary system fordetermining an emotion type of a requester according to some embodimentsof the present disclosure;

FIG. 17 is a block diagram illustrating an exemplary device fordetermining an emotion type of a requester according to some embodimentsof the present disclosure;

FIG. 18A is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 18B is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 19 is a schematic diagram illustrating an exemplary system forrecording text information associated with a smart dialoguecommunication according to some embodiments of the present disclosure;

FIG. 20A is a schematic diagram illustrating an exemplary device forrecording text information associated with a smart dialoguecommunication according to some embodiments of the present disclosure;

FIG. 20B is a schematic diagram illustrating an exemplary device forrecording text information associated with a smart dialoguecommunication according to some embodiments of the present disclosure;

FIG. 21 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure;

FIG. 22 is a schematic diagram illustrating an exemplary system foridentifying identity information of a requester according to someembodiments of the present disclosure;

FIG. 23 is a schematic diagram illustrating an exemplary device foridentifying identity information of a requester according to someembodiments of the present disclosure; and

FIG. 24 is a schematic diagram illustrating an exemplary deviceaccording to some embodiment of the present disclosure.

DETAILED DESCRIPTION

The following description is presented to enable any person skilled inthe art to make and use the present disclosure, and is provided in thecontext of a particular application and its requirements. Variousmodifications to the disclosed embodiments will be readily apparent tothose skilled in the art, and the general principles defined herein maybe applied to other embodiments and applications without departing fromthe spirit and scope of the present disclosure. Thus, the presentdisclosure is not limited to the embodiments shown, but is to beaccorded the widest scope consistent with the claims.

The terminology used herein is for the purpose of describing particularexample embodiments only and is not intended to be limiting. As usedherein, the singular forms “a,” “an,” and “the” may be intended toinclude the plural forms as well, unless the context clearly indicatesotherwise. It will be further understood that the terms “comprise,”“comprises,” and/or “comprising,” “include,” “includes,” and/or“including,” when used in this disclosure, specify the presence ofstated features, integers, steps, operations, elements, and/orcomponents, but do not preclude the presence or addition of one or moreother features, integers, steps, operations, elements, components,and/or groups thereof.

These and other features, and characteristics of the present disclosure,as well as the methods of operations and functions of the relatedelements of structure and the combination of parts and economies ofmanufacture, may become more apparent upon consideration of thefollowing description with reference to the accompanying drawings, allof which form a part of this disclosure. It is to be expresslyunderstood, however, that the drawings are for the purpose ofillustration and description only and are not intended to limit thescope of the present disclosure. It is understood that the drawings arenot to scale.

The flowcharts used in the present disclosure illustrate operations thatsystems implement according to some embodiments of the presentdisclosure. It is to be expressly understood, the operations of theflowcharts may be implemented not in order. Conversely, the operationsmay be implemented in inverted order, or simultaneously. Moreover, oneor more other operations may be added to the flowcharts. One or moreoperations may be removed from the flowcharts.

The various embodiments of the present disclosure may be applied todifferent fields, including without limitation to financial andderivatives investment (including without limitation to stocks, bonds,gold, paper gold, silver, foreign currency, precious metals, futures,money funds, etc.), technology (including without limitation tomathematics, physics, chemistry and chemical engineering, biology andbiological engineering, electrical engineering, communication systems,internet, internet of things, etc.), politics (including withoutlimitation to politicians, political events, countries, etc.), news (interms of regions, including without limitation to local news, nationalnews, international news; in terms of the subjects of news, includingwithout limitation to political news, technology news, economic news,social news, meteorological news, etc.). The above description of thefields of applications is merely for example, and should not beunderstood as the only embodiments. Obviously, those skilled in the art,after understanding the basic principles of the method and system forsmart dialogue communication, may modify the form and details of themethod and system for various application fields without departing fromthe principles. Such modifications and variations are still within thescope of the present disclosure described herein.

The present disclosure relates to systems and methods for smart dialoguecommunication. The systems and methods may receive a dialogue requestconfigured to request a smart dialogue communication from a responderterminal device. The dialogue request may be associated with an incomingcall request that is initiated by a requester via a requester terminaldevice and satisfies a smart dialogue condition determined by theresponder terminal device. The smart dialogue condition may beassociated with whether an incoming call number associated with theincoming call request exists in a contact list of the responder terminaldevice; whether the incoming call number exists in a blacklist of theresponder terminal device; and/or whether a request time length of theincoming call request exceeds a time threshold. The systems and methodsmay perform the smart dialogue communication with the requester terminaldevice associated with the requester, and/or generate dialogueinformation associated with the smart dialogue communication. Thedialogue information may include voice information, text information,etc. In some embodiments, the systems and methods may transmit thedialogue information to the responder terminal device in real-time orintermittently. In some embodiments, information associated with therequester and/or a responder (e.g., a user of the responder terminaldevice) may be determined based on the dialogue information. Exemplaryinformation may include schedule information, emotion type of therequester, identity information of the requester, or the like. In someembodiments, the systems and methods may provide an interface for theuser of the responder terminal device to intervene in the smart dialoguecommunication.

In one aspect of the present disclosure, a smart dialogue communicationplatform may obtain real-time dialogue information associated with thesmart dialogue communication, and transmit the real-time dialogueinformation to the responder terminal device, so that the user of theresponder terminal device can know or obtain the dialogue information inreal-time. Besides, the user can intervene in the smart dialoguecommunication by sending prompt information to the smart dialoguecommunication platform, thereby improving the efficiency of the smartdialogue communication, and improving user experiences.

In another aspect of the present disclosure, when the user is notconvenient to respond to the incoming call request, the smart dialoguecommunication platform may perform the smart dialogue communication withthe requester terminal device to obtain the incoming voice messages. Thesmart dialogue communication platform may convert the incoming voicemessages into the incoming text messages, and determine the eventkeywords by performing semantic analysis on the incoming text messages.The smart dialogue communication platform may then generate scheduleinformation based on the event keywords and/or transmit the scheduleinformation to the responder terminal device associated with theincoming call request for display. Therefore, information loss and themissing of important events may be avoided, thereby improving userexperiences. In some embodiments, the smart dialogue communicationplatform may store the schedule information instead of the voiceinformation (or text information) generated during the smart dialoguecommunication in a storage device associated with the smart dialoguecommunication platform, or the responder terminal device may store theschedule information instead of the voice information (or textinformation), thereby saving the storage space of the storage device(e.g., the storage device 150, the ROM 230, the RAM 240, the disk 270,or the storage 390, or an external storage device associated with thesmart dialogue communication system 100). By generating the scheduleinformation, the smart dialogue communication platform or the responderterminal device may obtain or identify relatively more effectiveinformation during the smart dialogue communication, thereby avoidinginformation loss (e.g., avoiding missing important events), providinggreat convenience for incoming call affair processing, and improving theuser experiences.

In yet another aspect of the present disclosure, the smart dialoguecommunication platform may perform the smart dialogue communication withthe requester terminal device to obtain the incoming voice messages. Thesmart dialogue communication platform may convert the incoming voicemessages into the incoming text messages. The smart dialoguecommunication platform may determine sentiment features by performingsentiment analysis on the incoming text messages, and determine moodfeatures by performing feature extraction on the incoming voicemessages. The smart dialogue communication platform may determine theintegrated features by integrating the sentiment features and the moodfeatures. The smart dialogue communication platform may determine anemotion type by matching the integrated features in an emotion databaseand transmit the emotion type to the responder terminal device. Thus,the user of the responder terminal device can know the emotion of therequester, and can respond to the incoming call request based on theemotion of the requester. Therefore, user experiences can be improved.In the present disclosure, the smart dialogue communication platform orthe responder terminal device may obtain or identify relatively moreeffective information during the smart dialogue communication (such asdetermining the emotion type of the requester), thereby making itconvenient for the user of the responder terminal device to deal withincoming call affairs based on the obtained information (such as theemotion type of the requester), and improving the user experiences.

In yet another aspect of the present disclosure, the smart dialoguecommunication may be performed when the user is not convenient to answerthe incoming call request (e.g., the user is a deaf mute). In thepresent disclosure, when an incoming call request is received, theresponder terminal device may first determine whether the smart dialoguecondition is satisfied. If the smart dialogue condition is satisfied,the smart dialogue communication may be performed, thereby saving thetime of the user and improving the response efficiency. Besides, thevoice information generated during the smart dialogue communication maybe recorded, thereby effectively avoiding information loss (e.g.,missing important incoming call request) and greatly improving the userexperiences.

In yet another aspect of the present disclosure, the smart dialoguecommunication may be performed when the user is not convenient to answerthe incoming call request. In the present disclosure, when an incomingcall request is received, the responder terminal device may firstdetermine whether the smart dialogue condition is satisfied. If thesmart dialogue condition is satisfied, the smart dialogue communicationmay be performed, thereby saving the time of the user and improving theresponse efficiency. Besides, the identity information of the requesteris identified, and the user can decide whether to intervene in the smartdialogue communication, thereby reducing or filtrating harassing callswhile not missing important calls, and improving user experiences.

FIG. 1 is a schematic diagram illustrating an exemplary smart dialoguecommunication system according to some embodiments of the presentdisclosure. For example, the smart dialogue communication system 100 mayinclude a server 110, a network 120, a requester terminal device 130, aresponder terminal device 140, and a storage device 150.

In some embodiments, the server 110 may be a single server or a servergroup. The server group may be centralized, or distributed (e.g., server110 may be a distributed system). In some embodiments, the server 110may be local or remote. For example, the server 110 may accessinformation and/or data stored in the requester terminal device 130, theresponder terminal device 140, and/or the storage device 150 via thenetwork 120. As another example, the server 110 may be directlyconnected to the requester terminal device 130, the responder terminaldevice 140, and/or the storage device 150 to access stored informationand/or data. In some embodiments, the server 110 may be implemented on acloud platform. Merely by way of example, the cloud platform may includea private cloud, a public cloud, a hybrid cloud, a community cloud, adistributed cloud, an inter-cloud, a multi-cloud, or the like, or anycombination thereof. In some embodiments, the server 110 may beimplemented on a computing device 200 having one or more componentsillustrated in FIG. 2 in the present disclosure.

In some embodiments, the server 110 may support or provide a function ofa smart dialogue communication platform (e.g., a smart dialoguecommunication robot) configured to perform a smart dialoguecommunication with one or more requester terminal devices (e.g., therequester terminal device 130). In some embodiments, the server 110 mayinclude a processing device 112. The processing device 112 may processinformation and/or data relating to the smart dialogue communicationand/or perform one or more functions described in the presentdisclosure. For example, the processing device 112 may generate dialogueinformation associated with the smart dialogue communication. In someembodiments, the processing device 112 may include one or moreprocessing devices (e.g., single-core processing device(s) or multi-coreprocessor(s)). Merely by way of example, the processing device 112 mayinclude one or more hardware processors, such as a central processingunit (CPU), an application-specific integrated circuit (ASIC), anapplication-specific instruction-set processor (ASIP), a graphicsprocessing unit (GPU), a physics processing unit (PPU), a digital signalprocessor (DSP), a field-programmable gate array (FPGA), a programmablelogic device (PLD), a controller, a microcontroller unit, a reducedinstruction-set computer (RISC), a microprocessor, or the like, or anycombination thereof.

The network 120 may facilitate the exchange of information and/or data.In some embodiments, one or more components in the smart dialoguecommunication system 100 (e.g., the server 110, the requester terminaldevice 130, the responder terminal device 140, and/or the storage device150) may send information and/or data to other component(s) in the smartdialogue communication system 100 via the network 120. For example, theserver 110 may obtain/acquire service request (e.g., a dialogue request)from the requester terminal device 130 or the responder terminal device140 via the network 120. As another example, the server 110 may transmitdialogue information to the responder terminal device 140 via thenetwork 120. In some embodiments, the network 120 may be any type ofwired or wireless network, or a combination thereof. Merely by way ofexample, the network 120 may include a cable network, a wirelinenetwork, an optical fiber network, a telecommunications network, anintranet, the Internet, a local area network (LAN), a wide area network(WAN), a wireless local area network (WLAN), a metropolitan area network(MAN), a wide area network (WAN), a public telephone switched network(PSTN), a Bluetooth™ network, a ZigBee™ network, a near fieldcommunication (NFC) network, or the like, or any combination thereof. Insome embodiments, the network 120 may include one or more network accesspoints. For example, the network 120 may include wired or wirelessnetwork access points such as base stations and/or internet exchangepoints 120-1, 120-2, . . . , through which one or more components of thesmart dialogue communication system 100 may be connected to the network120 to exchange data and/or information.

In some embodiments, the requester terminal device 130 may include amobile device 130-1, a tablet computer 130-2, a laptop computer 130-3, abuilt-in device in a motor vehicle 130-4, or the like, or anycombination thereof. In some embodiments, the mobile device 130-1 mayinclude a smart home device, a wearable device, a mobile device, avirtual reality device, an augmented reality device, or the like, or anycombination thereof. In some embodiments, the smart home device mayinclude a smart lighting device, a control device of an intelligentelectrical apparatus, a smart monitoring device, a smart television, asmart video camera, an interphone, or the like, or any combinationthereof. In some embodiments, the wearable device may include abracelet, footgear, glasses, a helmet, a watch, clothing, a backpack, asmart accessory, or the like, or any combination thereof. In someembodiments, the mobile device may include a mobile phone, a personaldigital assistance (PDA), a gaming device, a navigation device, a pointof sale (POS) device, a laptop, a desktop, or the like, or anycombination thereof. In some embodiments, the virtual reality deviceand/or the augmented reality device may include a virtual realityhelmet, a virtual reality glass, a virtual reality patch, an augmentedreality helmet, augmented reality glasses, an augmented reality patch,or the like, or any combination thereof. For example, the virtualreality device and/or the augmented reality device may include a GoogleGlass™, a RiftCon™, a Fragments™, a Gear VR™, etc. In some embodiments,a built-in device in the motor vehicle 130-4 may include an onboardcomputer, an onboard television, etc. In some embodiments, the responderterminal device 140 may be a device that is similar to, or the same asthe requester terminal device 130. For example, the requester terminaldevice 130 and/or the responder terminal device 140 may include a smartmobile phone. In some embodiments, the requester terminal device 130 maybe configured to initiate an incoming call request. The responderterminal device 140 may be configured to determine whether the incomingcall request satisfies a smart dialogue condition. In some embodiments,if the incoming call request does not satisfy the smart dialoguecondition, the responder terminal device 140 may respond to the incomingcall request. In some embodiments, if the incoming call requestsatisfies the smart dialogue condition, the responder terminal device140 may generate a dialogue request and/or transmit the dialogue requestto the server 110 to request a smart dialogue communication.

The storage device 150 may store data and/or instructions. In someembodiments, the storage device 150 may store data obtained from therequester terminal device 130 and/or the responder terminal device 140.In some embodiments, the storage device 150 may store data and/orinstructions that the server 110 may execute or use to perform exemplarymethods described in the present disclosure. In some embodiments,storage device 150 may include a mass storage device, a removablestorage device, a volatile read-and-write memory, a read-only memory(ROM), or the like, or any combination thereof. Exemplary mass storagedevices may include a magnetic disk, an optical disk, a solid-statedrive, etc. Exemplary removable storage devices may include a flashdrive, a floppy disk, an optical disk, a memory card, a zip disk, amagnetic tape, etc. Exemplary volatile read-and-write memory may includea random access memory (RAM). Exemplary RAM may include a dynamic RAM(DRAM), a double date rate synchronous dynamic RAM (DDR SDRAM), a staticRAM (SRAM), a thyrisor RAM (T-RAM), and a zero-capacitor RAM (Z-RAM),etc. Exemplary ROM may include a mask ROM (MROM), a programmable ROM(PROM), an erasable programmable ROM (EPROM), an electrically-erasableprogrammable ROM (EEPROM), a compact disk ROM (CD-ROM), and a digitalversatile disk ROM, etc. In some embodiments, the storage device 150 maybe implemented on a cloud platform. Merely by way of example, the cloudplatform may include a private cloud, a public cloud, a hybrid cloud, acommunity cloud, a distributed cloud, an inter-cloud, a multi-cloud, orthe like, or any combination thereof.

In some embodiments, the storage device 150 may be connected to thenetwork 120 to communicate with one or more components of the smartdialogue communication system 100 (e.g., the server 110, the requesterterminal device 130, the responder terminal device 140). One or morecomponents in the smart dialogue communication system 100 may access thedata or instructions stored in the storage device 150 via the network120. In some embodiments, the storage device 150 may be directlyconnected to or communicate with one or more components in the smartdialogue communication system 100 (e.g., the server 110, the requesterterminal device 130, the responder terminal device 140). In someembodiments, the storage device 150 may be part of the server 110.

In some embodiments, one or more components of the smart dialoguecommunication system 100 (e.g., the server 110, the requester terminaldevice 130, the responder terminal device 140) may have permission toaccess the storage device 150. In some embodiments, one or morecomponents of the smart dialogue communication system 100 may readand/or modify information relating to the requester, provider, and/orthe public when one or more conditions are met. For example, the server110 may read and/or modify one or more users' information after aservice is completed. As another example, the responder terminal device140 may access information relating to the requester when receiving aservice request from the requester terminal device 130, but theresponder terminal device 140 may not modify the relevant information ofthe requester.

One of ordinary skill in the art would understand that when an element(or component) of the smart dialogue communication system 100 performs,the element may perform through electrical signals and/orelectromagnetic signals. For example, when a requester terminal device130 or a responder terminal device 140 transmits out a service request(e.g., a dialogue request) to the server 110, a processor of therequester terminal device 130 or the responder terminal device 140 maygenerate an electrical signal encoding the request. The processor of therequester terminal device 130 or the responder terminal device 140 maythen transmit the electrical signal to an output port. If the requesterterminal device 130 or the responder terminal device 140 communicateswith the server 110 via a wired network, the output port may bephysically connected to a cable, which further may transmit theelectrical signal to an input port of the server 110. If the requesterterminal device 130 or the responder terminal device 140 communicateswith the server 110 via a wireless network, the output port of therequester terminal device 130 may be one or more antennas, which convertthe electrical signal to an electromagnetic signal. Similarly, therequester terminal device 130 or the responder terminal device 140 mayreceive an instruction and/or dialogue information from the server 110via electrical signals or electromagnet signals. Within an electronicdevice, such as the requester terminal device 130, the responderterminal device 140, and/or the server 110, when a processor thereofprocesses an instruction, transmits out an instruction, and/or performsan action, the instruction and/or action may be conducted via electricalsignals. For example, when the processor retrieves or saves data from astorage medium, it may transmit out electrical signals to a read/writedevice of the storage medium, which may read or write structured data inthe storage medium. The structured data may be transmitted to theprocessor in the form of electrical signals via a bus of the electronicdevice. Here, an electrical signal may refer to one electrical signal, aseries of electrical signals, and/or a plurality of discrete electricalsignals.

FIG. 2 is a schematic diagram illustrating exemplary hardware andsoftware components of a computing device according to some embodimentsof the present disclosure. In some embodiments, the server 110, therequester terminal device 130, and/or the responder terminal device 140may be implemented on the computing device 200. For example, theprocessing device 112 of the server 110 may be implemented on thecomputing device 200 and configured to perform functions of theprocessing device 112 disclosed in this disclosure.

The computing device 200 may be a general purpose computer or a specialpurpose computer, both may be used to implement a smart dialoguecommunication system (e.g., the smart dialogue communication system 100)for the present disclosure. The computing device 200 may be used toimplement any component of the smart dialogue communication system asdescribed herein. For example, the processing device 112 may beimplemented on the computing device 200, via its hardware, softwareprogram, firmware, or a combination thereof. Although only one suchcomputer is shown, for convenience, the computer functions relating tothe smart dialogue communication as described herein may be implementedin a distributed fashion on a number of similar platforms, to distributethe processing load.

The computing device 200, for example, may include a COM port 250connected to and/or from a network connected thereto to facilitate datacommunications. The computing device 200 may also include a processor220, in the form of one or more processors (or CPUs), for executingprogram instructions. The exemplary computing device may include aninternal communication bus 210, different types of program storage unitsand/or data storage units (e.g., a disk 270, a read only memory (ROM)230, a random access memory (RAM) 240), various data files applicable tocomputer processing and/or communication. The exemplary computing devicemay also include program instructions stored in the ROM 230, RAM 240,and/or any other type of non-transitory storage medium to be executed bythe processor 220. The method and/or process of the present disclosuremay be implemented as the program instructions. The computer device 200may also include an I/O device 260 that may support the input and/oroutput of data flows between the computing device 200 and othercomponents. The computing device 200 may also receive programs and/ordata via the communication network.

Merely for illustration, only one CPU and/or processor is described inthe computing device 200. However, it should be noted that the computingdevice 200 in the present disclosure may also include multiple CPUsand/or processors, thus operations and/or method operations that areperformed by one CPU and/or processor as described in the presentdisclosure may also be jointly or separately performed by the multipleCPUs and/or processors. For example, if in the present disclosure theCPU and/or processor of the computing device 200 executes both operationA and operation B, it should be understood that operation A andoperation B may also be performed by two different CPUs and/orprocessors jointly or separately in the computing device 200 (e.g., thefirst processor executes operation A and the second processor executesoperation B, or the first and second processors jointly executeoperations A and B).

FIG. 3 is a schematic diagram illustrating exemplary hardware and/orsoftware components of a mobile device according to some embodiments ofthe present disclosure. In some embodiments, the mobile device 300 maybe an exemplary embodiment corresponding to the requester terminaldevice 130 or the responder terminal device 140. As illustrated in FIG.3, the mobile device 300 may include a communication platform 310, adisplay 320, a graphic processing unit (GPU) 330, a central processingunit (CPU) 340, an I/O 350, a memory 360, an operating system (OS) 370,a storage 390. In some embodiments, any other suitable component,including but not limited to a system bus or a controller (not shown),may also be included in the mobile device 300.

In some embodiments, an operating system (OS) 370 (e.g., iOS™, Android™,Windows Phone™, etc.) and one or more applications (apps) 380 may beloaded into the memory 360 from the storage 390 in order to be executedby the CPU 340. The applications 380 may include a browser or any othersuitable mobile apps for receiving and/or rendering information relatingto smart dialogue communication or other information from the smartdialogue communication system 100. User interactions with theinformation stream may be achieved via the I/O 350 and provided to thestorage device 150, the server 110 and/or other components of the smartdialogue communication system 100.

To implement various modules, units, and their functionalities describedin the present disclosure, computer hardware platforms may be used asthe hardware platform(s) for one or more of the elements describedherein. A computer with user interface elements may be used to implementa personal computer (PC) or any other type of work station or terminaldevice. A computer may also act as a system if appropriately programmed.

FIG. 4 is a block diagram illustrating an exemplary processing deviceaccording to some embodiments of the present disclosure. The processingdevice 112 may be implemented on the computing device (e.g., theprocessor 220 of the computing device 200) as illustrated in FIG. 2,and/or the CPU 340 as illustrated in FIG. 3. In some embodiments, theprocessing device 112 may be implemented on a smart dialoguecommunication platform. The processing device 112 may include areceiving module 402, a smart dialogue communication module 404, adialogue information generation module 406, and a transmission module408.

The receiving module 402 may be configured to receive information and/ordata from a responder terminal device (e.g., the responder terminaldevice 140) via a network (e.g., the network 120). In some embodiments,the receiving module 402 may receive a dialogue request from theresponder terminal device. The dialogue request may be configured torequest a smart dialogue communication and/or be associated with anincoming call request. More descriptions of the dialogue request and/orthe incoming call request may be found elsewhere in the presentdisclosure (e.g., operation 501 of process 500 and the descriptionsthereof).

The smart dialogue communication module 404 may be configured to performa smart dialogue communication with a requester terminal device (e.g.,the requester terminal device 130). In some embodiments, the smartdialogue communication may refer to the communication between human(e.g., a requester of the requester terminal device) and a machine(e.g., a smart dialogue communication robot). In some embodiments, thesmart dialogue communication may be performed between the smart dialoguecommunication platform and the requester terminal device. Moredescriptions regarding the smart dialogue communication may be foundelsewhere in the present disclosure (e.g., FIG. 6 and the descriptionsthereof).

The dialogue information generation module 406 may be configured togenerate dialogue information associated with the smart dialoguecommunication. In some embodiments, the dialogue information may includevoice information, text information, etc. For example, the dialogueinformation may include voice information relating to or generatedduring the smart dialogue communication, such as incoming voice messages(e.g., voice messages from the requester terminal device), reply voicemessages (e.g., voice messages generated by the smart dialoguecommunication platform, or voice messages obtained from the responderterminal device, or the like). As another example, the dialogueinformation may include text information relating to or generated duringthe smart dialogue communication, such as incoming text messages (e.g.,text messages corresponding to the incoming voice messages, or textmessages obtained from the requester terminal device, or the like),reply text messages (e.g., text messages corresponding to the replyvoice messages, or text messages obtained from the responder terminaldevice, or the like).

The transmission module 408 may be configured to transmit the dialogueinformation to the responder terminal device (e.g., the responderterminal device 140). The transmission module 408 may transmit thedialogue information to the responder terminal device in real-time orintermittently. In some embodiments, the transmission module 408 maytransmit the dialogue information to the responder terminal deviceautomatically. Alternatively, the transmission module 408 may transmitthe dialogue information to the responder terminal device when receivingan instruction (e.g., a check instruction) from the responder terminaldevice.

The modules in the processing device 112 may be connected to orcommunicate with each other via a wired connection and/or a wirelessconnection. The wired connection may include a metal cable, an opticalcable, a hybrid cable, or the like, or any combination thereof. Thewireless connection may include a Local Area Network (LAN), a Wide AreaNetwork (WAN), a Bluetooth, a ZigBee, a Near Field Communication (NFC),or the like, or any combination thereof.

It should be noted that the above description is merely provided for thepurposes of illustration, and not intended to limit the scope of thepresent disclosure. For persons having ordinary skills in the art,multiple variations and modifications may be made under the teachings ofthe present disclosure. However, those variations and modifications donot depart from the scope of the present disclosure. In someembodiments, two or more of the modules of the processing device 112 maybe combined into a single module. For example, the receiving module 402and the transmission module 408 may be combined into a single module andconfigured to perform the functions thereof. As another example, theprocessing device 112 may include one or more additional modules. Forexample, the processing device 112 may include a storage module (notshown) configured to store data generated during the smart dialoguecommunication (e.g., dialogue information).

FIG. 5 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. At least a portion of process 500 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 500 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 500 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 500 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 500 as illustrated in FIG. 5 and described below is notintended to be limiting. In some embodiments, the process 500 may beimplemented on a smart dialogue communication platform.

In 501, the processing device 112 (e.g., the receiving module 402) mayreceive a dialogue request from a responder terminal device (e.g., theresponder terminal device 140). The dialogue request may refer to arequest for smart dialogue communication (e.g., with a smart dialoguecommunication platform). The dialogue request may be configured torequest a smart dialogue communication. In some embodiments, thedialogue request may be associated with an incoming call request. Theincoming call request may be a request (or an incoming call) for havinga conversation with a user of the responder terminal device. In someembodiments, the incoming call request may include informationassociated with the incoming call, for example, an incoming call time,who initiates the incoming call (or an incoming call number), etc. Insome embodiments, the dialogue request may include informationassociated with the incoming call request and/or the incoming call, forexample, an incoming call time, an incoming call number, a wait time forthe incoming call request, whether the user of the responder terminaldevice responds to the incoming call request, etc. In some embodiments,the incoming call request may be initiated by a requester via arequester terminal device (e.g., the requester terminal device 130),and/or further be received by the responder terminal device. Theincoming call request may be associated with the requester terminaldevice that initiates the incoming call request. The incoming callrequest may also be associated with the responder terminal device thatreceives the incoming call request. In some embodiments, the responderterminal device may receive the incoming call request from the requesterterminal device and/or determine whether the incoming call requestsatisfies a smart dialogue condition. In some embodiments, the incomingcall request may satisfy the smart dialogue condition, and the responderterminal device may generate the dialogue request based on the incomingcall request. More descriptions regarding the smart dialogue conditionmay be found elsewhere in the present disclosure (e.g., FIG. 18 and thedescriptions thereof).

In 503, the processing device 112 (e.g., the smart dialoguecommunication module 404) may perform the smart dialogue communicationwith the requester terminal device (e.g., the requester terminal device130) associated with the requester. In some embodiments, the smartdialogue communication (also referred to as human-computer interaction)may refer to the communication between human (e.g., the requester) and amachine (e.g., a smart dialogue communication robot). In someembodiments, the smart dialogue communication may be performed betweenthe smart dialogue communication platform and the requester terminaldevice. More descriptions regarding the smart dialogue communication maybe found elsewhere in the present disclosure (e.g., FIG. 6 and thedescriptions thereof).

In 505, the processing device 112 (e.g., the dialogue informationgeneration module 406) may generate dialogue information associated withthe smart dialogue communication. In some embodiments, the dialogueinformation may include voice information, text information, etc. Forexample, the dialogue information may include voice information relatingto or generated during the smart dialogue communication, such asincoming voice messages (e.g., voice messages from the requesterterminal device), reply voice messages (e.g., voice messages generatedby the smart dialogue communication platform, or voice messages obtainedfrom the responder terminal device, or the like). As another example,the dialogue information may include text information relating to orgenerated during the smart dialogue communication, such as incoming textmessages (e.g., text messages corresponding to the incoming voicemessages, or text messages obtained from the requester terminal device,or the like), reply text messages (e.g., text messages corresponding tothe reply voice messages, or text messages obtained from the responderterminal device, or the like). In some embodiments, the dialogueinformation (e.g., voice information or text information) may includequestion information and/or answer information relating to or generatedduring the smart dialogue communication. In some embodiments, thequestion information and/or answer information may be determined,identified, or generated by the smart dialogue communication platform.In some embodiments, the dialogue information may include scheduleinformation, information relating to emotion type of the requester,identity information of the requester, etc. In some embodiments, theschedule information, the information relating to emotion type of therequester, and/or the identity information of the requester may bedetermined, identified, or generated according to the dialogueinformation. The schedule information may be a daily (or weekly, ormonthly) schedule, for example, things to be done, when, where and/orhow the things are to be done. In some embodiments, the scheduleinformation may include a to-do-list. The emotion type of the requestermay include an emotion of the requester (e.g., happy, sad, calm,anxious, etc.), a character of the requester (e.g., extroversion,introversion, etc.). The identity information of the requester may referto an identity of the requester (e.g., a salesman, an estate agent, adeliveryman, etc.). In some embodiments, the identity of the requestermay relate to an occupation of the requester, a field of work of therequester, a title of the requester, a company the requester works for,etc. Exemplary schedule information may be determined according toprocess 1200 illustrated in FIG. 12. Exemplary emotion type of therequester may be determined according to process 1500 illustrated inFIG. 15. Exemplary identity information of the requester may bedetermined according to process 2100 illustrated in FIG. 21.

In some embodiments, the processing device 112 may directly record thetext information and/or the voice information generated during the smartdialogue communication to generate the dialogue information. In someembodiments, the processing device 112 may add a tag to each textmessage (e.g., an incoming text message, a text message generated by thesmart dialogue communication platform, a text message inputted by theresponder, etc.) or each voice message (e.g., an incoming voice message,a reply voice message, etc.) to generate the dialogue information. Thetag may indicate an initiator of each text message or each voice message(e.g., the requester, the smart dialogue communication platform, theresponder). In some embodiments, the processing device 112 may process(e.g., perform a semantic analysis, a feature extraction, etc.) the textinformation and/or the voice information generated during the smartdialogue communication, and/or generate the dialogue information basedon the processed information. In some embodiments, the processing device112 may generate the dialogue information in real time during the smartdialogue communication. In some embodiments, the processing device 112may record information associated with the smart dialogue communication,and generate the dialogue information after the smart dialoguecommunication is terminated.

In 507, the processing device 112 (e.g., the transmission module 408)may transmit the dialogue information to the responder terminal device(e.g., the responder terminal device 140). In some embodiments, theprocessing device 112 may transmit the dialogue information to theresponder terminal device in real-time, so that a user of the responderterminal device can check the dialogue information and/or intervene thesmart dialogue communication timely. For example, when the responderterminal device obtains the schedule information, the user of theresponder terminal device may decide whether to intervene the smartdialogue communication based on the schedule information. As anotherexample, when the responder terminal device obtains the emotion type ofthe requester, the user of the responder terminal device may decidewhether to intervene the smart dialogue communication based on theemotion type of the requester. Alternatively, the processing device 112may transmit the dialogue information to the responder terminal deviceintermittently. For example, the processing device 112 may transmit thedialogue information to the responder terminal device at intervals(e.g., every minute, every five minutes, every ten minutes, etc.). Asanother example, the processing device 112 may transmit the dialogueinformation to the responder terminal device when the smart dialoguecommunication is finished. In some embodiments, the processing device112 may transmit the dialogue information to the responder terminaldevice automatically. The user can check the dialogue information viathe responder terminal device at the user's convenience. Alternatively,the processing device 112 may transmit the dialogue information to theresponder terminal device when receiving an instruction (e.g., a checkinstruction) from the responder terminal device.

It should be noted that the above description regarding the process 500is merely provided for the purposes of illustration, and not intended tolimit the scope of the present disclosure. For persons having ordinaryskills in the art, multiple variations and modifications may be madeunder the teachings of the present disclosure. However, those variationsand modifications do not depart from the scope of the presentdisclosure. In some embodiments, a storing operation may be added. Thestoring operation may store the dialogue information and/or otherinformation generated in the process 500 in a storage device (e.g., thestorage device 150). In some embodiments, one or more operations inprocesses 800, 900, 1200, 1500, 1800, and/or 2100 may be added in theprocess 500.

FIG. 6 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. The smart dialogue communication may be performed between asmart dialogue communication platform and a requester terminal device(e.g., the requester terminal device 130). In some embodiments, thesmart dialogue communication platform may be integrated in or be part ofa responder terminal device (e.g., the responder terminal device 140).In such cases, when an incoming call request is received or a dialoguerequest is generated, the smart dialogue communication may be initiatedand/or performed between the requester terminal device and the responderterminal device. Alternatively, the smart dialogue communicationplatform may be installed, disposed or implemented on a backend system(e.g., a call center of a mobile communication operator). In such cases,if a received incoming call request satisfies a smart dialogue condition(e.g., whether an incoming call number associated with the incoming callrequest exists in a contact list of the responder terminal device), thesmart dialogue communication may be initiated and/or performed betweenthe requester terminal device and the smart dialogue communicationplatform.

At least a portion of process 600 may be implemented on the computingdevice 200 as illustrated in FIG. 2 or the mobile device 300 asillustrated in FIG. 3. In some embodiments, one or more operations ofthe process 600 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 600 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 600 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 600 as illustrated in FIG. 6 and described below is notintended to be limiting. In some embodiments, operation 503 illustratedin FIG. 5 may be performed according to the process 600.

In 601, the processing device 112 (e.g., the smart dialoguecommunication module 404) may obtain at least one incoming voice messagebased on the dialogue request. In some embodiments, the dialogue requestmay be associated with an incoming call request initiated by a requestervia a requester terminal device (e.g., the requester terminal device130). In some embodiments, the incoming voice message may include voicemessage(s) of the requester. In some embodiments, the processing device112 may obtain the at least one incoming voice message from therequester terminal device via a network (e.g., the network 120). In someembodiments, the requester may speak to the requester terminal device,and the voice of the requester may be converted into the incoming voicemessage by the requester terminal device. In some embodiments, theprocessing device 112 may obtain the at least one incoming voice messagein real-time or intermittently. For example, the processing device 112may obtain the incoming voice message once the incoming voice message isgenerated. As another example, the processing device 112 may obtain theincoming voice message when the requester finishes speaking to therequester terminal device.

In 603, the processing device 112 (e.g., the smart dialoguecommunication module 404) may convert the at least one incoming voicemessage into at least one incoming text message. In some embodiments,the processing device 112 may convert the incoming voice message intothe incoming text message in real-time or intermittently. For example,the processing device 112 may convert the incoming voice message intothe incoming text message when the incoming voice message is obtained.As another example, the processing device 112 may convert the incomingvoice message into the incoming text message when the processing device112 receives an instruction (e.g., text conversion instruction) from theresponder terminal device.

In some embodiments, the processing device 112 may convert the at leastone incoming voice message into the at least one incoming text messagebased on a speech recognition technique (also referred to as a speech totext technique). The speech recognition technique may be a hightechnology that allows machines to convert speech signals (or voicesignals) into corresponding text or commands through the process ofrecognition and understanding. Speech recognition technique may includeone or more operations, e.g., feature extraction, pattern matching,and/or model training. The speech recognition technique may use orinclude a feature parameter matching algorithm, a hidden Markov model(HMM) algorithm, an artificial neural network (ANN) algorithm, or thelike, or any combination thereof. In some embodiments, the processingdevice 112 may convert the incoming voice message into the incoming textmessage using a speech recognition model, e.g., a Gaussian mixture modeland hidden Markov model (GMM-HMM), a PyTorch-Kaldi speech recognitiontoolkit, etc. In some embodiments, the incoming voice message may beinput into the speech recognition model, and the speech recognitionmodel may convert the incoming voice message into the incoming textmessage and output the incoming text message. Merely by way of example,the speech recognition model may include an acoustic model and/or alanguage model. The acoustic model may be configured to determine afeature vector of the incoming voice message and determine one or morescores corresponding to one or more acoustic contents according toacoustic characteristics. The acoustic model may determine the acousticcontent with the highest score and input the acoustic content with thehighest score into the language model. The language model may beconfigured to determine one or more probabilities that the acousticcontent corresponds to one or more text sequences according to alinguistic-related theory and determine the acoustic content with thehighest probability as the incoming text message.

In 605, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine text features of the at leastone incoming text message. In some embodiments, the text features may beconfigured to express text information with a vector, which may bereferred to as text representation. In some embodiments, the textrepresentation may include discrete representation (e.g., a one-hotrepresentation, a multi-hot representation), distributed representation(e.g., a matrix-based representation, a dimension reduction-basedrepresentation, a cluster-based representation), neural network-basedrepresentation (e.g., continuous bag of words (CBOW), a neural networklanguage model (NNLM), a skip-gram model). In some embodiments, theprocessing device 112 may perform feature extraction on the at least oneincoming text message to determine the text features. In someembodiments, the processing device 112 may process the at least oneincoming text message by performing word segmentation and/or annotationon the at least one incoming text message. In some embodiments, theprocessing device 112 may perform the word segmentation and/orannotation on the at least one incoming text message according to one ormore word segmentation algorithms. Exemplary word segmentationalgorithms may include a character matching-based word segmentationalgorithm (e.g., a maximum matching algorithm, an omi-word segmentationalgorithm, a statistical language model algorithm, etc.), a sequenceannotation-based word segmentation algorithm (e.g., POS tagging), a deeplearning-based word segmentation algorithm (e.g., a hidden Markov modelalgorithm), or the like, or any combination thereof. In someembodiments, the processing device 112 may process the incoming textmessage using a THU lexical analyzer for Chinese (THULAC) toolkit. Insome embodiments, the processing device 112 may determine the textfeatures of the incoming text message in real-time or intermittently.For example, the processing device 112 may determine the text featureswhen the incoming text message is determined. As another example, theprocessing device 112 may determine the text features when theprocessing device 112 receives an instruction (e.g., feature extractioninstruction) from the responder terminal device.

In some embodiments, the processing device 112 may determine the textfeatures by performing feature extraction on the at least one processedincoming text message using a language model. The incoming text messagemay be input into the language model, and the language model maygenerate the text features of the incoming text message and output thetext features. Exemplary language models may include a generalizedautoregressive pretraining for language understanding (XLNet) model, astatistical language model (e.g., N-gram), a neural network languagemodel, a recurrent neural network language model, a neural probabilisticlanguage model etc. In some embodiments, the language model may begenerated by training a preliminary model using a plurality of trainingsamples. The training process may be performed by the processing device112 or any other processing device (e.g., an external processing deviceof the smart dialogue communication system 100). The plurality oftraining samples may include a plurality of sample text messages andsample text features of the sample text messages. The preliminary modelmay include a plurality of parameters that need to be adjusted. In someembodiments, the plurality of sample text messages may be inputted intothe preliminary model to determine actual output(s). The sample textfeatures of the sample text messages may be determined as desiredoutput(s). The processing device 112 or any other processing device(e.g., an external processing device of the smart dialogue communicationsystem 100) may compare the actual output(s) with the desired output(s)to determine loss function value(s). The loss function value(s) maymeasure difference(s) between the actual output(s) and the desiredoutput(s). In the training process of the preliminary model, theplurality of parameters may be adjusted to minimize the loss functionvalue(s). In some embodiments, the loss function value(s) and/or theparameters may be updated iteratively in order to obtain minimized lossfunction value(s). After the loss function value(s) are minimized, atrained language model may be determined according to the adjustedparameters.

In 607, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine question information associatedwith the text features by matching the text features in a questionknowledge database. In some embodiments, the question knowledge databasemay store a plurality of questions (also referred to as candidatequestions) or information relating to the questions. In someembodiments, the question knowledge database may be stored in a storagedevice (e.g., the storage device 150, the ROM 230, the RAM 240, the disk270, the storage 390, etc.) associated with the smart dialoguecommunication platform. Alternatively, the question knowledge databasemay be stored in an external storage device, and the processing device112 may retrieve the question knowledge database via a network (e.g.,the network 120). The question knowledge database may be preset by thesmart dialogue communication system 100, or self-defined by a user ofthe responder terminal device 140. In some embodiments, the processingdevice 112 may match the text features in the question knowledgedatabase using a first semantic matching model (e.g., a deep structuredsemantic model (DSSM), a long short term memory deep structured semanticmodel (LSTM-DSSM), an enhanced LSTM for natural language inference(ESIM), an attention-based convolutional neural network for modelingsentence pairs (ABCNN), etc.). For example, the processing device 112may input the text features and/or the candidate questions within thequestion knowledge database into the first semantic matching model. Thefirst semantic matching model may generate and/or output a matchingdegree between the text features and a (e.g., each) candidate question.In some embodiments, the processing device 112 may determine thequestion information based on the matching degree(s). For example, theprocessing device 112 may determine the candidate question having thehighest matching degree as the question information.

In 609, the processing device 112 (e.g., the smart dialoguecommunication module 404) may obtain answer information corresponding tothe question information by matching the question information in ananswer knowledge database. In some embodiments, the answer knowledgedatabase may store a plurality of answers (also referred to as candidateanswers) or information relating to the answers. In some embodiments,the answer knowledge database may be stored in a storage device (e.g.,the storage device 150, the ROM 230, the RAM 240, the disk 270, thestorage 390, etc.) associated with the smart dialogue communicationplatform. Alternatively, the answer knowledge database may be stored inan external storage device, and the processing device 112 may retrievethe answer knowledge database via a network (e.g., the network 120). Theanswer knowledge database may be preset by the smart dialoguecommunication system 100, or self-defined by a user of the responderterminal device 140. In some embodiments, there may be a one-to-onecorrespondence between the answers in the answer knowledge database andthe questions in the question knowledge database. For example, eachquestion may correspond to an answer. In some embodiments, there may bea one-to-more correspondence between the answers in the answer knowledgedatabase and the questions in the question knowledge database. Forexample, one or more questions may correspond to an answer. Forinstance, questions such as “who are you?” “Are you XXX?” may correspondto the same answer such as “I'm a smart assistant of XXX.” In someembodiments, there may be a more-to-one correspondence between theanswers in the answer knowledge database and the questions in thequestion knowledge database. For example, a question may correspond toone or more answers. For instance, question such as “Can I speak toXXX?” may correspond to answers such as “XXX is busy now, would you mindleaving a message?” “I'm a smart assistance of XXX, you can leave amessage.” The processing device 112 may match the question informationin the answer knowledge database using a second semantic matching model(e.g., a deep structured semantic model (DSSM), a long short term memorydeep structured semantic model (LSTM-DSSM), an enhanced LSTM for naturallanguage inference (ESIM), an attention-based convolutional neuralnetwork for modeling sentence pairs (ABCNN), etc.). For example, theprocessing device 112 may input the question information and/or thecandidate answers within the answer knowledge database into the semanticmatching model. The semantic matching model may generate and/or output amatching degree between the question information and a (e.g., each)candidate answer. In some embodiments, the processing device 112 maydetermine the answer information based on the matching degree(s). Forexample, the processing device 120 may determine the candidate answerhaving the highest matching degree as the answer information.

In some embodiments, the second semantic matching model may be same asor similar to the first semantic matching model as described inoperation 607. For example, the first semantic matching model and thesecond semantic matching model may both be a deep structured semanticmodel (DSSM). Alternatively, the second semantic matching model may bedifferent from the first semantic matching model. For example, the firstsemantic matching model may be a deep structured semantic model (DSSM),and the second semantic matching model may be an attention-basedconvolutional neural network for modeling sentence pairs (ABCNN). Thefirst semantic matching model may be used to determine a matching degreebetween the text features and a (e.g., each) candidate question. Theinput of the first semantic matching model may include the text featuresand the candidate questions. The output of the first semantic matchingmodel may include a matching degree between the text features and a(e.g., each) candidate question. The second semantic matching model maybe used to determine a matching degree between the question informationand a (e.g., each) candidate answer. The input of the second semanticmatching model may include the question information and the candidateanswers. The output of the second semantic matching model may include amatching degree between the question information and a (e.g., each)candidate answer.

In some embodiments, the question knowledge database and the answerknowledge database may be integrated into a single knowledge database.In some embodiments, the integrated knowledge database may include aplurality of question-answer pairs. A question-answer pair may include acandidate question in the question knowledge database and acorresponding candidate answer in the answer knowledge database. In someembodiments, the processing device 112 may directly determine the answerinformation by retrieving or matching the question information with thequestion-answer pairs in the integrated knowledge database. In someembodiments, the processing device 112 may directly determine the answerinformation by retrieving or matching the text features with thequestion-answer pairs in the integrated knowledge database, and thus,operation 607 may be omitted.

In 611, the processing device 112 (e.g., the smart dialoguecommunication module 404) may convert the answer information into atleast one reply voice message. In some embodiments, the processingdevice 112 may convert the answer information into the reply voicemessage using a text to speech technique (e.g., a speech synthesismodel). Exemplary speech synthesis models may include a generative modelfor raw audio (e.g., WaeNet), a parallel wave generation in end-to-endtext-to-speech (e.g., ClariNet), a parallel neural text-to-speech (e.g.,ParaNet), or the like. In some embodiments, the answer information maybe input into the speech synthesis model, the speech synthesis model maygenerate and/or output the reply voice message.

In 613, the processing device 112 (e.g., the smart dialoguecommunication module 404) may transmit the at least one reply voicemessage to the requester terminal device (e.g., the requester terminaldevice 130). In some embodiments, the processing device 112 may transmitthe at least one reply voice message to the requester terminal devicevia the network (e.g., the network 120). In some embodiments, theprocessing device 112 may transmit the at least one reply voice messageto the requester terminal device using a full-duplex voice interactiontechnique, which may effectively control the rhythm of the dialogue andimplement real-time recovery once the dialogue is interrupted. In someembodiments, after the reply voice message is received by the requesterterminal device, the requester terminal device may play the reply voicemessage to the requester. In some embodiments, the requester of therequester terminal device may respond to the reply voice message, e.g.,continuing the smart dialogue communication with the smart dialoguecommunication platform, finishing or terminating the smart dialoguecommunication, etc. In some embodiments, if the smart dialoguecommunication is interrupted for a threshold time period, and/or therequester terminal device does not respond to the smart dialoguecommunication for a threshold time period, the smart dialoguecommunication platform may finish or terminate the smart dialoguecommunication, and/or restart the smart dialogue communication.

It should be noted that the above description regarding the process 600is merely provided for the purposes of illustration, and not intended tolimit the scope of the present disclosure. For persons having ordinaryskills in the art, multiple variations and modifications may be madeunder the teachings of the present disclosure. However, those variationsand modifications do not depart from the scope of the presentdisclosure. For example, if the requester of the requester terminaldevice decides to continue the smart dialogue communication, theprocessing device 112 may repeat the process 600 until the smartdialogue communication is finished or terminated. As another example,the processing device 112 may generate a question-answer record based onthe at least one incoming text message, the question information, andthe answer information. In some embodiments, the processing device 112may update the question knowledge database and/or the answer knowledgedatabase based on the question-answer record. In some embodiments, theanswer knowledge database may include general answers (e.g., “OK, Iknow”, “What else can I do for you”). If an incoming text messagematches with no question information (in the question knowledgedatabase) and/or no answer information (in the answer knowledgedatabase), the processing device 112 may generate the at least one replyvoice message based on the general answer(s). In some embodiments,before operation 601, the smart dialogue communication platform may senda prompt information to the requester terminal device to remind therequester that a smart dialogue communication is initiated.

FIG. 7 is a schematic diagram illustrating an exemplary system for smartdialogue communication according to some embodiments of the presentdisclosure. It should be noted that the functions of the system areprovided for illustration purposes, and are not intended to limit thescope of the present disclosure. In some embodiments, the system may beimplemented as a smart dialogue communication platform.

As shown in FIG. 7, the smart dialogue communication platform (e.g., asmart dialogue communication robot) may receive at least one voicemessage 701 from a requester terminal device (e.g., the requesterterminal device 130) using a full-duplex voice interaction technique723. The at least one voice message 701 may be converted into at leastone incoming text message 705 using a speech recognition model 703(e.g., a PyTorch-Kaldi speech recognition toolkit). The smart dialoguecommunication platform may perform feature extraction 707 (e.g., wordsegmentation, annotation) on the at least one incoming text message 705to determine text features 709. The smart dialogue communicationplatform may determine question information 711 associated with the textfeatures 709 by matching the text features in a question knowledgedatabase 713. The smart dialogue communication platform may furtherobtain answer information 713 corresponding to the question information715 by matching the question information 715 in an answer knowledgedatabase 717. The smart dialogue communication platform may convert theanswer information 715 into at least one reply voice message 719 using atext-to-speech technique. The smart dialogue communication platform maythen transmit the at least one reply voice message 719 to the requesterterminal device using the full-duplex voice interaction technique 723.In some embodiments, questions in the question knowledge database andanswers in the answer knowledge database may form a plurality ofquestion-answer pairs, which may be stored in an integrated knowledgedatabase 721. The knowledge database 721 (e.g., including the questionknowledge database 713 and the answer knowledge database 717) may beupdated based on the text message, the question information, and/or theanswer information. More descriptions regarding the smart dialoguecommunication may be found elsewhere in the present disclosure (e.g.,FIG. 6 and the descriptions thereof).

FIG. 8 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. At least a portion of process 800 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 800 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 800 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 800 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 800 as illustrated in FIG. 8 and described below is notintended to be limiting.

In 801, the processing device 112 (e.g., the smart dialoguecommunication module 404) may perform a smart dialogue communicationwith a requester terminal device (e.g., the requester terminal device130) associated with a requester based on a dialogue request. In someembodiments, the dialogue request may be associated with an incomingcall request. In some embodiments, the incoming call request may beinitiated by the requester via the requester terminal device. In someembodiments, a responder terminal device may receive the incoming callrequest from the requester terminal device and/or determine whether theincoming call request satisfies a smart dialogue condition. In someembodiments, the incoming call request may satisfy the smart dialoguecondition, and the responder terminal device may generate the dialoguerequest based on the incoming call request. In some embodiments, theresponder terminal device may transmit the dialogue request to the smartdialogue communication platform, and the processing device 112 mayperform the smart dialogue communication based on the dialogue request.In some embodiments, the processing device 112 may automatically performthe smart dialogue communication based on the dialogue request.

In some embodiments, before the smart dialogue communication isperformed, the processing device 112 may determine whether an answerinstruction corresponding to the incoming call request is received fromthe responder terminal device. In some embodiments, when an incomingcall request is received by the responder terminal device, informationrelating to the incoming call (e.g., the incoming call number, theidentity of the requester, etc.) may be displayed on the display deviceof the responder terminal device. A user of the responder terminaldevice may decide whether to respond to the incoming call request. Ifthe user decides to respond to the incoming call request, the responderterminal device may generate an answer instruction and/or send theanswer instruction to the smart dialogue communication platform, theresponder terminal device may not generate the dialogue request, andaccordingly, the processing device 112 may not perform the smartdialogue communication. If the user decides not to respond to theincoming call request, the responder terminal device may not generatethe answer instruction, or the responder terminal may directly refusethe incoming call request. In some embodiments, the processing device112 may determine whether the answer instruction is received within apredetermined time period. The predetermined time period may be adefault value set by the smart dialogue communication system 100, or maybe preset or adjusted by the user of the responder terminal device. Insome embodiments, in response to determining that the answer instructionis not received within the predetermined time period (e.g., the userdecides not to respond to the incoming call request), the processingdevice 112 may perform the smart dialogue communication, therebyeffectively avoid information loss (e.g., avoid missing incoming callrequest(s)).

The smart dialogue communication may be performed between the smartdialogue communication platform and the requester terminal device. Moredescriptions regarding the smart dialogue communication may be foundelsewhere in the present disclosure (e.g., FIG. 6 and the descriptionsthereof). In some embodiments, the dialogue request may be associatedwith an incoming call request initiated by the requester via therequester terminal device. The dialogue request may satisfy a smartdialogue condition determined by the responder terminal device. Thesmart dialogue condition may be preset by the responder terminal deviceor any other component of the smart dialogue communication system 100.More descriptions of the smart dialogue condition may be found elsewherein the present disclosure (e.g., FIG. 18 and the relevant descriptionsthereof).

In 803, the processing device 112 (e.g., the dialogue informationgeneration module 406) may generate and/or obtain real-time dialogueinformation associated with the smart dialogue communication. In someembodiments, the dialogue information may include incoming voicemessages (e.g., voice messages from the requester) and/or reply voicemessages (e.g., voice messages generated by the smart dialoguecommunication platform). The processing device 112 may record theincoming voice messages and the reply voice messages in real-time andgenerate the real-time dialogue information. In some embodiments, thedialogue information may include text information. For example, theprocessing device 112 may record the incoming text message(s), questioninformation associated with the incoming text message(s), and/or answerinformation corresponding to the question information, and/or generatethe real-time dialogue information. More descriptions of the dialogueinformation, question information, and answer information may be foundelsewhere in the present disclosure (e.g., FIGS. 5-6 and descriptionsthereof).

In 805, the processing device 112 (e.g., the transmission module 408)may transmit the real-time dialogue information to a responder terminaldevice (e.g., the responder terminal device 140). In some embodiments,the dialogue information may include voice information. In someembodiments, the processing device 112 may convert the voice informationinto text information. For example, the processing device 112 mayconvert the voice information into text information based on a speechrecognition technique (also referred to as a speech to text technique).The speech recognition technique may use or include a feature parametermatching algorithm, a hidden Markov model (HMM) algorithm, an artificialneural network (ANN) algorithm, or the like, or any combination thereof.In some embodiments, the processing device 112 may convert the voiceinformation into the text information using a speech recognition model,e.g., a Gaussian mixture model and hidden Markov model (GMM-HMM), aPyTorch-Kaldi speech recognition toolkit, etc. The voice information maybe input into the speech recognition model, and the speech recognitionmodel may generate and/or output the text information. The processingdevice 112 may transmit the text information to the responder terminaldevice, and/or accordingly, the responder terminal device may displaythe text information on a display device of the responder terminaldevice. Alternatively, the processing device 112 may transmit the voiceinformation to the responder terminal device, and/or accordingly, theresponder terminal device may broadcast the voice information via avoice player of the responder terminal device. In some embodiments, theprocessing device 112 may transmit the dialogue information to theresponder terminal device 140 in real-time, or intermittently.

In 807, the processing device 112 (e.g., the receiving module 402) maydetermine whether prompt information is received. The prompt informationmay be associated with a reply instruction provided by the responderterminal device based on the real-time dialogue information.

In some embodiments, when or after the responder terminal devicereceives the real-time dialogue information, a user of the responderterminal device may decide whether to intervene in the smart dialoguecommunication. If the user intervenes in the smart dialoguecommunication, the user may send, via the responder terminal device,prompt information to the smart dialogue communication platform.Alternatively, if the user decides not to intervene in the smartdialogue communication, the user may not send, via the responderterminal device, prompt information to the smart dialogue communicationplatform. In some embodiments, the prompt information may refer toinformation provided by the user via the responder terminal device,and/or be used to intervene in the smart dialogue communication platform(e.g., used to replace the answer information determined by the smartdialogue communication platform, or used to respond to the requesterterminal device, etc.). In some embodiments, the prompt information mayinclude text information, voice information, expression information(e.g., emoji), body language information, or the like, or anycombination thereof.

In some embodiments, in response to determining that the promptinformation is not received, the processing device 112 may proceed tooperation 803, or the processing device 112 may continue performing thesmart dialogue communication. In 803, the processing device 112 maycontinue obtaining the real-time dialogue information associated withthe smart dialogue communication. Alternatively, in response todetermining that the prompt information is received, the processingdevice 112 may proceed to operation 809. In 809, the processing device112 (e.g., the smart dialogue communication module 404) may convert theprompt information into a reply voice message. In some embodiments, theprocessing device 112 (e.g., the transmission module 408) may transmitthe reply voice message to the requester terminal device. In someembodiments, the processing device 112 may convert the promptinformation into the reply voice message based on a text to speechtechnique (e.g., a speech synthesis model). The speech synthesis modelmay include but is not limited to a generative model for raw audio(e.g., WaeNet), a parallel wave generation in end-to-end text-to-speech(e.g., ClariNet), a parallel neural text-to-speech (e.g., ParaNet), orthe like. In some embodiments, the prompt information may be input intothe speech synthesis model, and the speech synthesis model may generateand/or output the reply voice message. In some embodiments, theprocessing device 112 may transmit the reply voice message to therequester terminal device using a full-duplex voice interactiontechnique.

In some embodiments of the present disclosure, the smart dialoguecommunication platform may transmit the real-time dialogue informationto the responder terminal device, so that the user of the responderterminal device can know or obtain the dialogue information inreal-time. Besides, the user can intervene in the smart dialoguecommunication by sending prompt information to the smart dialoguecommunication platform, which may improve the efficiency of the smartdialogue communication, and improve user experiences.

It should be noted that the above description of the process 800 isprovided for the purposes of illustration, and is not intended to limitthe scope of the present disclosure. For persons having ordinary skillsin the art, multiple variations and modifications may be made under theteachings of the present disclosure. However, those variations andmodifications do not depart from the scope of the present disclosure. Insome embodiments, if the user decides to intervene in the smart dialoguecommunication, the user may send, via the responder terminal device, aninstruction to the smart dialogue communication platform to end thesmart dialogue communication, and may directly communicate, via theresponder terminal device, with the requester terminal device. In someembodiments, the processing device 112 may record the voice information(e.g., the voice information including the incoming voice message(s) andthe reply voice message(s)) associated with the smart dialoguecommunication, and generate a dialogue record based on the voiceinformation. The dialogue record may include information relating to adialogue process between the requester and the smart dialoguecommunication platform (and/or the user of the responder terminaldevice). For example, the dialogue record may include one or more firstsentences and/or words said by the requester, and/or one or more secondsentences and/or words provided by the dialogue communication platform(and/or the user of the responder terminal device). In some embodiments,the first sentences and/or words and the second sentences and/or wordsmay be presented sequentially according to their appearance in the smartdialogue communication. The dialogue record may be stored in a storagedevice (e.g., the storage device 150, the ROM 230, the RAM 240, the disk270, the storage 390, etc.) of the smart dialogue communicationplatform. In some embodiments, the processing device 112 may transmitthe dialogue record to the responder terminal device, so that the userof the responder terminal device can check the dialogue record inreal-time, or intermittently. Then, the user can deal with eventsinvolved in the dialogue record according to the importance of events,which may improve user experiences. In some embodiments, the dialoguerecord may be broadcasted via a voice player. Alternatively, thedialogue record may be displayed (e.g., in a form of list) on a displaydevice (e.g., a display device of the responder terminal device).

FIG. 9 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. At least a portion of process 900 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 900 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 900 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 900 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 900 as illustrated in FIG. 9 and described below is notintended to be limiting. In some embodiments, one or more operations ofthe process 900 may be similar to operation(s) of the process 800.

In 901, a responder terminal device (e.g., the responder terminal device140) may receive an incoming call request initiated by a requester via arequester terminal device (e.g., the requester terminal device 130). In903, the responder terminal device may determine whether the incomingcall request satisfies a smart dialogue condition. In response todetermining that the incoming call request does not satisfy the incomingcall request, in 905, the responder terminal device may respond to theincoming call request. For example, the responder terminal device maydirectly communicate with the requester terminal device. As anotherexample, a user of the responder terminal device may refuse the incomingcall request (e.g., the user may directly hang up by clicking a hang-upbutton presented on the display device of the responder terminaldevice).

In response to determining that the incoming call request satisfies thesmart dialogue condition, the responder terminal device may generate adialogue request. The dialogue request may be configured to request asmart dialogue communication. The responder terminal device may transmitthe dialogue request to a smart dialogue communication platform. In 907,the smart dialogue communication platform (e.g., the processing device112) may perform the smart dialogue communication with the requesterterminal device associated with the requester. More descriptions of theincoming call request, the dialogue request, and the smart dialoguecommunication may be found elsewhere in the present disclosure (e.g.,FIGS. 5-6 and descriptions thereof).

In 909, the responder terminal device may receive real-time dialogueinformation associated with the smart dialogue communication. In 911,the responder terminal device may determine whether to intervene in thesmart dialogue communication. In response to determining that theresponder terminal device needs to intervene in the smart dialoguecommunication, in 913, the responder terminal device may generate promptinformation and/or transmit the prompt information to the smart dialoguecommunication platform. The smart dialogue communication platform mayconvert the prompt information into reply voice information and/ortransmit the reply voice information to the requester terminal device.More descriptions of the prompt information and the conversion of theprompt information may be found elsewhere in the present disclosure(e.g., FIG. 8 and descriptions thereof).

In response to determining that the responder terminal device does notintervene in the smart dialogue communication, in 915, the smartdialogue communication platform (e.g., the processing device 112) maycontinue performing the smart dialogue communication with the requesterterminal device associated with the requester. In 917, the smartdialogue communication platform (e.g., the processing device 112) maygenerate a dialogue record based on the smart dialogue communicationand/or transmit the dialogue record to the responder terminal device.

FIG. 10 is a schematic diagram illustrating an exemplary system forsmart dialogue communication according to some embodiments of thepresent disclosure. It should be noted that the system is provided forillustration purposes, and are not intended to limit the scope of thepresent disclosure. The system may include a requester terminal device,a responder terminal device, and a smart dialogue communication platform(e.g., a smart dialogue communication robot). As shown in FIG. 10, thecommunication between the requester terminal device, the responderterminal device, and/or the smart dialogue communication platform may beimplemented via the network (e.g., the network 120).

As shown in FIG. 10, a requester of the requester terminal device maysend an incoming call request 01 via the network. A user of theresponder terminal device may receive the incoming call request 02 viathe network. The responder terminal device may determine whether theincoming call request 02 satisfies a smart dialogue condition. If thesmart dialogue condition is satisfied, the responder terminal device maygenerate a dialogue request 03 configured to request a smart dialoguecommunication based on the incoming call request 02. The responderterminal device may transmit the dialogue request 03 to the smartdialogue communication platform via the network.

The smart dialogue communication platform may perform the smart dialoguecommunication with the requester terminal device via the network. Insome embodiments, the smart dialogue communication platform may transmita message (e.g., a voice message, a text message, etc.) to the requesterterminal device to inform the requester terminal device that the smartdialogue communication platform is answering the incoming call request.For example, the smart dialogue communication platform may transmit amessage such as “Hello, I'm a smart assistant, can I help you?” to therequester terminal device. The requester terminal device may transmitthe incoming voice message 04 to the smart dialogue communicationplatform via the network. The smart dialogue communication platform mayreceive the incoming voice message 04 via the network. The smartdialogue communication platform may generate reply voice message(s) 09based on a knowledge database (e.g., a question knowledge database,and/or an answer knowledge database, etc.) in response to the incomingvoice message 04. The smart dialogue communication platform may transmitthe reply voice message(s) 09 to the requester terminal device.

During the smart dialogue communication, the smart dialoguecommunication platform may generate and/or obtain real-time dialogueinformation 06, and/or transmit the real-time dialogue information 06 tothe responder terminal device. The user of the responder terminal devicecan check the real-time dialogue information 06 and decides whether tointervene in the smart dialogue communication (e.g., whether to generateprompt information 07). If the user of the responder terminal deviceintervenes in the smart dialogue communication, the responder terminaldevice may generate the prompt information 07, and the responderterminal device may transmit the prompt information 07 to the smartdialogue communication platform. The smart dialogue communicationplatform may convert the prompt information 07 into reply voicemessage(s) 09, and transmit the reply voice message(s) 09 to therequester terminal device.

FIG. 11 is a block diagram illustrating an exemplary device for smartdialogue communication according to some embodiments of the presentdisclosure. The device (also referred to as a smart dialoguecommunication device) 1100 may be implemented on the computing device(e.g., the processor 220 of the computing device 200) as illustrated inFIG. 2, and/or the CPU 340 as illustrated in FIG. 3. As shown in FIG.11, the device 1100 may include a smart dialogue communication module1102, an information transmission module 1104, an informationdetermination module 1106, and an information prompt module 1108. Insome embodiments, the smart dialogue communication module 1102, theinformation transmission module 1104, the information determinationmodule 1106, and the information prompt module 1108 may be implementedon a smart dialogue communication platform.

The smart dialogue communication module 1102 may be configured toperform a smart dialogue communication with a requester terminal deviceassociated with a requester based on a dialogue request, and obtainreal-time dialogue information associated with the smart dialoguecommunication. The information transmission module 1104 may beconfigured to transmit the real-time dialogue information to a responderterminal device. The information determination module 1106 may beconfigured to determine whether prompt information is received. Theprompt information may be associated with a reply instruction providedby the responder terminal device based on the real-time dialogueinformation.

The device 1100 in the present disclosure may be used to perform one ormore operations of process 800 as described in FIG. 8. Theimplementation principle and technical effects of the device 1100 may besimilar to the operations illustrated in the process 800. In someembodiments, the device 1100 may further include a display deviceconfigured to display the real-time dialogue information. In someembodiments, the device 1100 may be an exemplary embodiment of theprocessing device 112.

FIG. 12 is a flowchart illustrating an exemplary process for generatingschedule information according to some embodiments of the presentdisclosure. At least a portion of process 1200 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 1200 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 1200 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 1200 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 1200 as illustrated in FIG. 12 and described below is notintended to be limiting.

In 1201, the processing device 112 (e.g., the smart dialoguecommunication module 404) may perform a smart dialogue communicationwith a requester terminal device (e.g., the requester terminal device130) associated with a requester based on a dialogue request. Moredescriptions of the dialogue request and the performing of the smartdialogue communication may be found elsewhere in the present disclosure(e.g., FIGS. 5-6, and 8 and descriptions thereof).

In 1203, the processing device 112 (e.g., the smart dialoguecommunication module 404) may obtain incoming voice messages associatedwith the smart dialogue communication. The incoming voice messages mayinclude voice messages of the requester generated in the incoming callrequest and/or the smart dialogue communication. In some embodiments,the processing device 112 may obtain the incoming voice messages fromthe requester terminal device via a network (e.g., the network 120).

In 1205, the processing device 112 (e.g., the smart dialoguecommunication module 404) may convert the incoming voice messages intoincoming text messages. In some embodiments, the processing device 112may convert the incoming voice messages into the incoming text messagesbased on a speech recognition technique (also referred to as a speech totext technique). The speech recognition technique may use or include afeature parameter matching algorithm, a hidden Markov model (HMM)algorithm, an artificial neural network (ANN) algorithm, or the like, orany combination thereof. In some embodiments, the processing device 112may convert the incoming voice messages into the incoming text messagesusing a speech recognition model, e.g., a Gaussian mixture model andhidden Markov model (GMM-HMM), a PyTorch-Kaldi speech recognitiontoolkit, etc. The incoming voice messages may be input into the speechrecognition model, and the speech recognition model may generate and/oroutput the incoming text messages.

In 1207, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine event keywords by performingsemantic analysis on the incoming text messages. In some embodiments,the processing device 112 may process the incoming text messages byperforming word segmentation and/or annotation on the incoming textmessages. For example, the processing device 112 may process theincoming text messages using a THU lexical analyzer for Chinese (THULAC)toolkit. The processing device 112 may then determine text features byperforming feature extraction on the processed incoming text messagesusing a language model. Exemplary language models may include ageneralized autoregressive pretraining for language understanding(XLNet) model, a statistical language model (e.g., N-gram), a neuralnetwork language model, a recurrent neural network language model, aneural probabilistic language model etc. For example, the processingdevice 112 may determine the text features by performing featureextraction on the processed incoming text messages using an XLNet model.The processed incoming text messages may be input into the languagemodel, and the language model may generate and/or output the textfeatures. In some embodiments, the processing device 112 may determinethe event keywords by performing semantic analysis on the extractedinformation using a semantic analysis model. Exemplary semantic analysismodels may include a transition-based neural abstract syntax parser(TRANX).

In some embodiments, the smart dialogue communication may relate to anevent. For example, the requester may provide the user (or responder)with information relating to an event or request the user to deal withthe event during the smart dialogue communication. The event may relateto one or more things to deal with (e.g., participate a party, attend aconference, reserve a ticket, reserve a room, etc.). In someembodiments, the event keywords may include a name of the requester, anevent time, an event location, an event content, or the like, or anycombination thereof. The name of the requester may refer to the name ofthe requester who initiates the incoming call request. The event timemay refer to the time related to the event (e.g., the time of dealingwith the event, the occurrence time of the event, etc.). The eventlocation may refer to the location related to the event (e.g., thelocation of dealing with the event, the occurrence location of theevent, etc.). The event content may refer to descriptions or details ofthe event. It can be understood that the above descriptions of the eventkeywords are merely for illustration and are not intended to limit thescope of the present disclosure. In some embodiments, the event keywordsmay be defined or set by the user. For example, the event keywords mayinclude other information, such as incoming call time.

In some embodiments, to determine the event time, the processing device112 may obtain time information by performing the semantic analysis onthe incoming text messages. In some embodiments, the processing device112 may determine whether the time information includes an absolutetime. In some embodiments, the time information may include an absolutetime, e.g., eight o'clock (8:00 am). In some embodiments, the timeinformation may include a relative time, e.g., after eight hours. Inresponse to determining that the time information includes the absolutetime, the processing device 112 may designate the time information asthe event time. Alternatively, in response to determining that the timeinformation includes the relative time, the processing device 112 maydetermine the event time based on the time information and a systemtime. For example, if the time information is “after eight hours”, andthe system time is eight o'clock (8:00 am), then the event time may bedetermined as sixteen o'clock (4:00 pm).

In 1209, the processing device 112 (e.g., the smart dialoguecommunication module 404) may generate schedule information based on theevent keywords. The schedule information may include a daily (or weekly,or monthly) schedule of the user (e.g., the responder). The scheduleinformation may include one or more things to be done, such asparticipating a party, attending a conference, reserving a ticket,reserving a room, etc. In some embodiments, the schedule information maybe denoted by “the name of the requester: ***; the event time: ***; theevent location: ***; the event content: ***.” In some embodiments, theschedule information may be denoted by any other form. In someembodiments, the items and/or the format of the schedule information maybe preset by the smart dialogue communication system 100 or defined bythe user of the responder terminal device.

In 1211, the processing device 112 (e.g., the transmission module 408)may transmit the schedule information to a responder terminal device(e.g., the responder terminal device 140) associated with the incomingcall request for presentation (e.g., display). In some embodiments, theschedule information may be displayed on a display device of theresponder terminal device.

In some embodiments, the processing device 112 or the responder terminaldevice may determine an emergency level of the schedule informationaccording to a level classification rule. In some embodiments, theprocessing device 112 or the responder terminal device may determine areminder time of the schedule information based on the emergency level.In some embodiments, the processing device 112 may transmit the remindertime to the responder terminal device. In some embodiments, theresponder terminal device may generate a reminder at the reminder time.In some embodiments, the emergency level may include three levels: (1)urgent affairs (which need to be dealt with immediately), (2) generalaffairs (which can be dealt with after a time period), (3) harassingcalls (which need not to be dealt with). If the emergency level isdirected to urgent affairs, after the schedule information is receivedby the responder terminal device, the responder terminal device mayimmediately remind the user with an alarm. If the emergency level isdirected to general affairs, the responder terminal device may generatea reminder (e.g., send out an alarm). In some embodiments, the remindertime and/or the ways of the reminder may be defined or preset by theuser of the responder terminal device. In some embodiments, the remindermay be generated at the reminder time (e.g., a preset time before theevent time (e.g., a day before the event time, five hours before theevent time, one hour before the event time, etc.)). In some embodiments,if the emergency level is directed to harassing calls, the processingdevice 112 or the responder terminal device may not generate thereminder.

In some embodiments of the present disclosure, when the user is notconvenient to respond to the incoming call request, the smart dialoguecommunication platform (e.g., the processing device 112) may perform thesmart dialogue communication with the requester terminal device toobtain the incoming voice messages. The processing device 112 mayconvert the incoming voice messages into the incoming text messages, anddetermine the event keywords by performing semantic analysis on theincoming text messages. The processing device 112 may then generateschedule information based on the event keywords and/or transmit theschedule information to the responder terminal device associated withthe incoming call request for display. Therefore, the omission ofinformation and the missing of important events may be avoided, therebyimproving user experiences. In some embodiments, the processing device112 may store the schedule information instead of the voice information(or text information) generated during the smart dialogue communicationin a storage of the smart dialogue communication platform, or theresponder terminal device may store the schedule information instead ofthe voice information (or text information), thereby saving the storagespace of the storage device (e.g., the storage device 150, the ROM 230,the RAM 240, the disk 270, or the storage 390, or an external storagedevice associated with the smart dialogue communication system 100). Bygenerating the schedule information, the processing device 112 or theresponder terminal device may obtain or identify relatively moreeffective information based on the smart dialogue communication, therebyavoiding information loss (e.g., avoid missing important events),providing great convenience for incoming call affair processing, andimproving the user experiences.

It should be noted that the above description of the process 1200 isprovided for the purposes of illustration, and is not intended to limitthe scope of the present disclosure. For persons having ordinary skillsin the art, multiple variations and modifications may be made under theteachings of the present disclosure. However, those variations andmodifications do not depart from the scope of the present disclosure. Insome embodiments, before performing the smart dialogue communication,the responder terminal device may determine whether an incoming callnumber of the incoming call request exists in a contact list of theresponder terminal device. If the incoming call number exists in thecontact list, the responder terminal device may determine a name in thecontact list corresponding to the incoming call number as the name ofthe requester. If the incoming call number does not exist in the contactlist, the name of the requester may be determined based on the smartdialogue communication as described in operation 1207.

FIG. 13 is a schematic diagram illustrating an exemplary system forgenerating schedule information according to some embodiments of thepresent disclosure. It should be noted that the functions of the systemare provided for illustration purposes, and are not intended to limitthe scope of the present disclosure.

As shown in FIG. 13, a responder terminal device (e.g., the responderterminal device 140) may receive an incoming call request 1301 from arequester terminal device (e.g., the requester terminal device 130). Insome embodiments, if the incoming call request 1301 includes an unknownor strange number (i.e., the incoming call number is unknown orstrange), or the incoming call request 1301 is not answered by theresponder terminal device (also referred to as missed call request), theresponder terminal device may determine that the incoming call request1301 satisfies the smart dialogue condition, and transmit the incomingcall request 1301 to a smart dialogue communication platform. In 1302,the smart dialogue communication platform may perform the smart dialoguecommunication with the requester terminal device. In some embodiments,the smart dialogue communication platform may communicate with therequester terminal device based on a knowledge database 1303 (e.g., aquestion knowledge database, an answer knowledge database, etc.). Insome embodiments, the smart dialogue communication platform may obtainvoice information generated in the smart dialogue communication. In1313, the smart dialogue communication platform may perform a textconversion on the voice information, i.e., the smart dialoguecommunication platform may convert the voice information into textinformation. For example, the smart dialogue communication platform mayconvert incoming voice messages into incoming text messages.

In some embodiments, the smart dialogue communication platform mayprocess the incoming text messages by performing word segmentationand/or annotation on the incoming text messages. In 1304, the smartdialogue communication platform may determine text features byperforming feature extraction on the processed incoming text messagesusing a language model 1305. The smart dialogue communication platformmay then determine event keywords 1306 by performing semantic analysison the text features using a semantic analysis model. The event keywords1306 may include the name the requester, the event time, the eventlocation, the event content, or the like.

If the incoming call number of the incoming call request exists in acontact list of the responder terminal device, the responder terminaldevice may determine a name in the contact list corresponding to theincoming call number as the name of the requester. In some embodiments,the smart dialogue communication platform or the responder terminaldevice may determine whether the event time is an absolute time. If theevent time 1307 is an absolute time, the smart dialogue communicationplatform or the responder terminal device may directly confirm the eventtime. If the event time is not an absolute time (e.g., a relative time),the smart dialogue communication platform or the responder terminaldevice may obtain the system time, and/or determine the event time basedon the relative time and the system time.

In some embodiments, the smart dialogue communication platform maygenerate the schedule information based on the event keywords and/ortransmit the schedule information to the responder terminal device. Insome embodiments, the smart dialogue communication platform may transmitthe event keywords to the responder terminal device, and the responderterminal device may generate the schedule information based on the eventkeywords. In some embodiments, the responder terminal device may displaythe schedule information on a display device 1314. The presentation ofthe schedule information may be shown in FIG. 13. In some embodiments,the responder terminal device may provide a user interface to facilitatethe user of the responder terminal device to edit the scheduleinformation based on the text information converted in 1313.

In some embodiments, the smart dialogue communication platform or theresponder terminal device may generate a reminder based on the scheduleinformation. For example, in 1308, the smart dialogue communicationplatform or the responder terminal device may perform an eventclassification on the schedule information according to an eventclassification rule 1309. According to the event classification rule1309, the event(s) may be classified base on a field of the event, animportance of the event, etc. In some embodiments, the smart dialoguecommunication platform or the responder terminal device may determine anemergency level (also referred to as emergency level matching operation1310) according to a level classification rule 1311. In someembodiments, in 1312, the smart dialogue communication platform or theresponder terminal device may set a reminder based on the emergencylevel. In some embodiments, the reminder may include sending out analarm immediately; sending out an alarm at one day before the eventtime, five hours before the event time, one hour before the event time,or the like.

FIG. 14 is a block diagram illustrating an exemplary device for smartdialogue communication according to some embodiments of the presentdisclosure. The device (also referred to as a schedule generationdevice) 1400 may be implemented on the computing device (e.g., theprocessor 220 of the computing device 200) as illustrated in FIG. 2,and/or the CPU 340 as illustrated in FIG. 3. As shown in FIG. 14, thedevice 1400 may include a smart dialogue communication module 1402, atext conversion module 1404, a semantic analysis module 1406, a schedulegeneration module 1408, and a schedule display module 1410. In someembodiments, the smart dialogue communication module 1402, the textconversion module 1404, the semantic analysis module 1406, the schedulegeneration module 1408, and the schedule display module 1410 may beimplemented on a smart dialogue communication platform.

The smart dialogue communication module 1402 may be configured toperform a smart dialogue communication with a requester terminal deviceassociated with a requester based on a dialogue request, and/or obtainincoming voice messages associated with the smart dialoguecommunication. The text conversion module 1404 may be configured toconvert the incoming voice messages into incoming text messages. Thesemantic analysis module 1406 may be configured to determine eventkeywords by performing semantic analysis on the incoming text messages.The schedule generation module 1408 may be configured to generateschedule information based on the event keywords. The schedule displaymodule 1410 may be configured to transmit the schedule information to aresponder terminal device associated with the incoming call request fordisplay.

The device 1400 in the present disclosure may be used to perform one ormore operations of process 1200 as described in FIG. 12. Theimplementation principle and technical effects of the device 1400 may besimilar to the operations illustrated in the process 1200. In someembodiments, the device 1400 may be an exemplary embodiment of theprocessing device 112.

FIG. 15 is a flowchart illustrating an exemplary process for determiningan emotion type of a requester according to some embodiments of thepresent disclosure. At least a portion of process 1500 may beimplemented on the computing device 200 as illustrated in FIG. 2 or themobile device 300 as illustrated in FIG. 3. In some embodiments, one ormore operations of the process 1500 may be implemented in the smartdialogue communication system 100 as illustrated in FIG. 1. In someembodiments, one or more operations in the process 1500 may be stored inthe storage device 150 and/or the storage (e.g., the ROM 230, the RAM240, etc.) as a form of instructions, and invoked and/or executed by theserver 110 (e.g., the processing device 112 in the server 110illustrated in FIG. 1, the processor 220 illustrated in FIG. 2, the CPU340 illustrated in FIG. 3, or one or more modules in the processingdevice 112 illustrated in FIG. 4). In some embodiments, the instructionsmay be transmitted in a form of electronic current or electricalsignals. The operations of the illustrated process presented below areintended to be illustrative. In some embodiments, the process 1500 maybe accomplished with one or more additional operations not described,and/or without one or more of the operations discussed. Additionally,the order in which the operations of the process 1500 as illustrated inFIG. 15 and described below is not intended to be limiting.

In 1501, the processing device 112 (e.g., the smart dialoguecommunication module 404) may perform a smart dialogue communicationwith a requester terminal device (e.g., the requester terminal device130) associated with a requester based on a dialogue request. Moredescriptions of the dialogue request and the performing of the smartdialogue communication may be found elsewhere in the present disclosure(e.g., FIGS. 5-6, and 8 and descriptions thereof).

In 1503, the processing device 112 (e.g., the smart dialoguecommunication module 404) may obtain incoming voice messages associatedwith the smart dialogue communication. The incoming voice messages mayinclude voice messages of the requester generated in the incoming callrequest and/or the smart dialogue communication. In some embodiments,the processing device 112 may obtain the incoming voice messages fromthe requester terminal device via a network (e.g., the network 120).

In 1505, the processing device 112 (e.g., the smart dialoguecommunication module 404) may convert the incoming voice messages intoincoming text messages. In some embodiments, the processing device 112may convert the incoming voice messages into the incoming text messagesbased on a speech recognition technique (also referred to as a speech totext technique). The speech recognition technique may use or include afeature parameter matching algorithm, a hidden Markov model (HMM)algorithm, an artificial neural network (ANN) algorithm, or the like, orany combination thereof. In some embodiments, the processing device 112may convert the incoming voice messages into the incoming text messagesusing a speech recognition model, e.g., a Gaussian mixture model andhidden Markov model (GMM-HMM), a PyTorch-Kaldi speech recognitiontoolkit, etc. The incoming voice messages may be input into the speechrecognition model, and the speech recognition model may generate and/oroutput the incoming text messages.

In 1507, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine sentiment features by performingsentiment analysis on the incoming text messages. In some embodiments,the sentiment features may include feature words representing emotions.In some embodiments, the sentiment features may be expressed as one ormore vectors. In some embodiments, the processing device 112 may performthe sentiment analysis using a sentiment analysis model. Exemplarysentiment analysis models may include an XLNet model, a bidirectionalencoder representation from transformers (BERT) model, or the like, orany combination thereof. The incoming text messages may be input intothe sentiment analysis model, and the sentiment analysis model maygenerate and/or output the sentiment features.

In 1509, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine mood features by performingfeature extraction on the incoming voice messages. In some embodiments,the mood features may include feature words representing tones. In someembodiments, the mood features may be expressed as one or more vectors.In some embodiments, the sentiment features may indicate emotionalfluctuations of the requester when the requester speaks. The moodfeatures may indicate tone changes of the requester when the requesterspeaks. The processing device 112 may perform the feature extractionusing a feature extraction model. Exemplary feature extraction modelsmay include a weighted finaite-state transducer (WFST) model. Theincoming voice messages may be input into the feature extraction model,and the feature extraction model may generate and/or output the moodfeatures.

In 1511, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine integrated features byintegrating the sentiment features and the mood features. The integratedfeatures may include feature words representing emotions and tones ofthe requester. In some embodiments, the processing device 112 maydirectly combine the sentiment features and the mood features togenerate the integrated features (e.g., an integrated vector).Alternatively, the processing device 112 may assign a first weight forthe sentiment features and a second weight for the mood features, anddetermine the integrated features based on the sentiment features andthe mood features and corresponding weights. The present disclosureconsiders both the sentiment features (corresponding to the incomingtext messages) and the mood features (corresponding to the incomingvoice messages), thereby ensuring the accuracy of emotion recognitionresults.

In 1513, the processing device 112 (e.g., the smart dialoguecommunication module 404) may determine an emotion type (of therequester) by matching the integrated features in an emotion database.The emotion type may include an emotion of the requester (e.g., happy,sad, calm, anxious, etc.), a character of the requester (e.g.,extroversion, introversion, etc.), or the like. The emotion database maystore a plurality of candidate emotion types. The emotion database maybe stored in a storage device associated with the smart dialoguecommunication platform (e.g., the storage device 150, the ROM 230, theRAM 240, the disk 270, the storage 390, etc.). Alternatively, theemotion database may be stored in an external storage device, and theprocessing device 112 may retrieve the emotion database via a network(e.g., the network 120). The emotion database may be preset by the smartdialogue communication system 100, or self-defined by a user of theresponder terminal device 140. In some embodiments, the processingdevice 112 may match the integrated features in the emotion databaseusing a third semantic matching model (e.g., a deep structured semanticmodel (DSSM), a long short term memory deep structured semantic model(LSTM-DSSM), an enhanced LSTM for natural language inference (ESIM), anattention-based convolutional neural network for modeling sentence pairs(ABCNN), etc.). For example, the processing device 112 may input theintegrated features and/or the candidate emotion types within theemotion database into the third semantic matching model. The thirdsemantic matching model may generate and/or output a matching degreebetween the integrated features and a (e.g., each) candidate emotiontype. In some embodiments, the processing device 112 may determine theemotion type based on the matching degree(s). For example, theprocessing device 112 may determine the candidate emotion type havingthe highest matching degree as the emotion type.

In some embodiments, if the emotion type is not obtained by matching theintegrated features in the emotion database, the processing device 112may create a new emotion type based on the integrated features, anddesignate the new emotion type as the emotion type. In some embodiments,the processing device 112 may update the emotion database by adding thenew emotion type to the emotion database. Thus, the processing device112 can quickly identify the emotion type of a requester when theprocessing device 112 receives the same or similar incoming call requestfrom the requester.

In 1515, the processing device 112 (e.g., the transmission module 408)may transmit the emotion type to a responder terminal device (e.g., theresponder terminal device 140) associated with the incoming callrequest. In some embodiments, the responder terminal device may displaythe emotion type on a display device of the responder terminal device.The display format may be preset by the smart dialogue communicationsystem 100 or defined by the user of the responder terminal device. Forexample, the display format may include “the incoming call number: ***;the emotion type: ***; “the character of the requester: ***”; theincoming call time: ***.” In some embodiments, if the incoming callnumber exists in the contact list, the display information may alsoinclude “the name of the requester: ***.”

In some embodiments, the processing device 112 may generate suggestioninformation based on the emotion type. The suggestion information mayindicate whether to respond to the incoming call request. In someembodiments, the processing device 112 may transmit the suggestioninformation to the responder terminal device to facilitate the user ofthe responder terminal device to respond to the incoming call request orthe requester terminal device according to the suggestion information.Merely by way of example, if the emotion type is determined as“anxiety”, the processing device 112 may generate suggestion information“call back immediately” and transmit the suggestion information to theresponder terminal device. In some embodiments, the responder terminaldevice may display the suggestion information on the display device ofthe responder terminal device. In some embodiments, the responderterminal device may broadcast the suggestion information via a voiceplayer of the responder terminal device. In some embodiments, theprocessing device 112 may adjust a tone pattern of the smart dialoguecommunication based on the emotion type, thereby improving userexperiences and user friendliness.

In some embodiments of the present disclosure, the processing device 112may perform the smart dialogue communication with the requester terminaldevice to obtain the incoming voice messages. The processing device 112may convert the incoming voice messages into the incoming text messages.The processing device 112 may determine sentiment features by performingsentiment analysis on the incoming text messages, and determine moodfeatures by performing feature extraction on the incoming voicemessages. The processing device 112 may determine the integratedfeatures by integrating the sentiment features and the mood features.The processing device 112 may determine an emotion type by matching theintegrated features in an emotion database and transmit the emotion typeto the responder terminal device. Thus, the user of the responderterminal device can know the emotions of the requester, and can respondto the incoming call request based on the emotions of the requester.Therefore, user experiences can be improved. In the present disclosure,the processing device 112 or the responder terminal device may obtain oridentify relatively more effective information during the smart dialoguecommunication (such as determining the emotion type of the requester),thereby making it convenient for the user of the responder terminaldevice to deal with incoming call affairs based on the obtainedinformation (such as the emotion type of the requester), and improvingthe user experiences.

It should be noted that the above description of the process 1500 isprovided for the purposes of illustration, and is not intended to limitthe scope of the present disclosure. For persons having ordinary skillsin the art, multiple variations and modifications may be made under theteachings of the present disclosure. However, those variations andmodifications do not depart from the scope of the present disclosure. Insome embodiments, operations 1507 and 1509 may be performedsimultaneously. In some embodiments, operation 1509 may be performedbefore or after operation 1507.

FIG. 16 is a schematic diagram illustrating an exemplary system fordetermining an emotion type of a requester according to some embodimentsof the present disclosure. It should be noted that the functions of thesystem are provided for illustration purposes, and are not intended tolimit the scope of the present disclosure.

As shown in FIG. 16, in 1601, a responder terminal device (e.g., theresponder terminal device 140) may receive an incoming call request froma requester terminal device (e.g., the requester terminal device 130).If the incoming call request satisfies the smart dialogue condition, theresponder terminal device may transmit the incoming call request to asmart dialogue communication platform, or the responder terminal devicemay generate a dialogue request based on the incoming call request andtransmit the dialogue request to the smart dialogue communicationplatform. In 1602, the smart dialogue communication platform may performthe smart dialogue communication with the requester terminal device. Insome embodiments, the smart dialogue communication platform maycommunicate with the requester terminal device based on a knowledgedatabase (e.g., a question knowledge database, an answer knowledgedatabase, etc.).

In 1603, the smart dialogue communication platform may obtain incomingvoice messages in the smart dialogue communication. In 1604, the smartdialogue communication platform may perform feature extraction on theincoming voice messages to determine mood features 1605 (e.g., a volumeof the voice, whether the tone is hasty, etc.). In 1606, the smartdialogue communication platform may perform text conversion on the voicemessages (e.g., convert the incoming voice messages into incoming textmessages). In 1607, the smart dialogue communication platform mayperform sentiment analysis on the incoming text messages to determinesentiment features 1608 (e.g., happy, neutral, sad, etc.). The smartdialogue communication platform may determine or generate integratedfeatures 1609 by integrating the sentiment features and the moodfeatures. In 1610, the smart dialogue communication platform may matchthe integrated features 1609 in an emotion database 1611, and determinewhether the integrated features are matched with an emotion type (alsoreferred as an emotion or character) in the emotion database. If theintegrated features are matched with a candidate emotion type, the smartdialogue communication platform may determine the candidate emotion typeas the emotion type 1613. If the integrated features are not matchedwith a candidate emotion type, in 1612, the smart dialogue communicationplatform may create a new emotion type and determine the new emotiontype as the emotion type 1613. In some embodiments, the smart dialoguecommunication platform may update the emotion database by adding the newemotion type to the emotion database. In some embodiments, the smartdialogue communication platform may transmit the emotion type to theresponder terminal device. The responder terminal device may display theemotion type on a display device of the responder terminal device.Merely by way of example, the display format may include “the incomingcall number: ***; the emotion type: ***; the incoming call time: ***.”As another example, the display format may include “the incoming callnumber: ***; the emotion of the requester: ***; the character of therequester: ***; the incoming call time: ***.” As a further example, thedisplay format may include “the incoming call number: ***; the name ofthe requester: ***; the emotion type: ***; the incoming call time: ***.”

FIG. 17 is a block diagram illustrating an exemplary device fordetermining an emotion type of a requester according to some embodimentsof the present disclosure. The device (also referred to as an emotionrecognition device) 1700 may be implemented on the computing device(e.g., the processor 220 of the computing device 200) as illustrated inFIG. 2, and/or the CPU 340 as illustrated in FIG. 3. As shown in FIG.17, the device 1700 may include a smart dialogue communication module1702, a first feature extraction module 1704, a second featureextraction module 1706, a feature integration module 1708, a semanticmatching module 1710, and an information transmission module 1712. Insome embodiments, the smart dialogue communication module 1702, thefirst feature extraction module 1704, the second feature extractionmodule 1706, the feature integration module 1708, the semantic matchingmodule 1710, and the information transmission module 1712 may beimplemented on a smart dialogue communication platform.

The smart dialogue communication module 1702 may be configured toperform a smart dialogue communication with a requester terminal deviceassociated with a requester based on a dialogue request, and/or obtainincoming voice messages associated with the smart dialoguecommunication. The first feature extraction module 1704 may beconfigured to convert the incoming voice messages into incoming textmessages, and/or determine sentiment features by performing sentimentanalysis on the incoming text messages. The second feature extractionmodule 1706 may be configured to determine mood features by performingfeature extraction on the incoming voice messages. The featureintegration module 1708 may be configured to determine integratedfeatures by integrating the sentiment features and the mood features.The semantic matching module 1710 may be configured to determine anemotion type by matching the integrated features in an emotion database.The information transmission module 1712 may be configured to transmitthe emotion type to a responder terminal device associated with theincoming call request.

The device 1700 in the present disclosure may be used to perform one ormore operations of process 1500 as described in FIG. 15. Theimplementation principle and technical effects of the device 1700 may besimilar to the operations illustrated in the process 1500. In someembodiments, the device 1700 may be an exemplary embodiment of theprocessing device 112.

FIG. 18A is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. At least a portion of process 1800 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 1800 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 1800 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 1800 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 1800 as illustrated in FIG. 18A and described below is notintended to be limiting.

In 1801, a responder terminal device (e.g., the responder terminaldevice 140) may receive an incoming call request initiated by arequester via a requester terminal device (e.g., the requester terminaldevice 130). The incoming call request may be transmitted to therequester terminal device via a network (e.g., the network 120).

In 1803, the responder terminal device (e.g., the responder terminaldevice 140) may determine whether the incoming call request satisfies asmart dialogue condition. If the incoming call request satisfies thesmart dialogue condition, the responder terminal device may generate adialogue request configured to request a smart dialogue communicationbased on the incoming call request. The responder terminal device mayfurther transmit the dialogue request to a smart dialogue communicationplatform. In some embodiments, the smart dialogue condition may beassociated with whether an incoming call number associated with theincoming call request exists in a contact list of the responder terminaldevice; whether the incoming call number exists in a blacklist of theresponder terminal device; whether a requester time length of theincoming call request exceeds a time threshold (e.g., 1 minute, 2minutes). In some embodiments, the time threshold may be a default valueset by the smart dialogue communication system 100, or may be preset oradjusted by the user of the responder terminal device.

In some embodiments, the responder terminal device may determine anincoming call number based on the incoming call request, and/ordetermine whether the incoming call number exists in the contact list.In response to determining that the incoming call number exists in thecontact list, the responder terminal device may output the incoming callnumber. For example, the responder terminal device may display theincoming call number to remind the user to respond to the incoming callrequest. In some embodiments, the user of the responder terminal devicemay respond to the incoming call request (e.g., answering the incomingcall, refusing the incoming call). In some embodiments, beforeoutputting (or displaying) the incoming call number, the responderterminal device may determine whether the incoming call number exists ina blacklist of the responder terminal device. In response to determiningthat the incoming call number exists in the blacklist, the responderterminal device may refuse the incoming call request, and/or generate orrecord incoming call information based on the incoming call request. Theincoming call information may include information related to therequester and/or the incoming call. For example, the incoming callinformation may include: the incoming call number, the name of therequester, the incoming call time, the reason of being added into theblacklist, whether to be removed from the blacklist, or the like, or anycombination thereof. The responder terminal device may output theincoming call information (e.g., display the incoming call informationon a display device of the responder terminal device). In someembodiments, in response to determining that the incoming call numberdoes not exist in the blacklist, the responder terminal device maydirectly output the incoming call number to remind the user to respondto the incoming call request. In some embodiments, after outputting (ordisplaying) the incoming call number, the responder terminal device maydetermine a request time length of the incoming call requester (inreal-time). In response to determining that the request time lengthexceeds the time threshold, the responder terminal device may determinethat the incoming call request satisfies the smart dialogue condition.

In some embodiments, in response to determining that the incoming callnumber does not exist in the contact list, the responder terminal devicemay determine that the incoming call request satisfies the smartdialogue condition. In some embodiments, before determining that theincoming call request satisfies the smart dialogue condition, theresponder terminal device may determine whether the incoming call numberexists in the blacklist. In response to determining that the incomingcall number exists in the blacklist, the responder terminal device mayrefuse the incoming call request, and generate the incoming callinformation. The responder terminal device may output the incoming callinformation (e.g., display the incoming call information on a displaydevice of the responder terminal device). In some embodiments, inresponse to determining that the incoming call number does not exist inthe blacklist, the responder terminal device may determine that theincoming call request satisfies the smart dialogue condition.

In some embodiments, the responder terminal device may first determinewhether the incoming call request exists in the blacklist. If theincoming call request exists in the blacklist, the responder terminaldevice may refuse the incoming call request, generate the incoming callinformation based on the incoming call request, and/or output theincoming call information. If the incoming call request does not existin the blacklist, the responder terminal device may then determinewhether the incoming call request satisfies the smart dialoguecondition.

It should be noted that the above descriptions of the smart dialoguecondition may be merely provided for illustration purposes, and be notintended to limit the scope of the present disclosure. In someembodiments, the smart dialogue condition may be set by the responderterminal device or the user of the responder terminal device. Forexample, the responder terminal device may set one or more specifiedincoming call numbers as the smart dialogue condition. When an incomingcall request with a specified incoming call number is received by theresponder terminal device, the responder terminal device may determinethat the incoming call request satisfies the smart dialogue condition.

In 1805, the smart dialogue communication platform (e.g., the processingdevice 112) may perform a smart dialogue communication with therequester terminal device associated with the requester. Moredescriptions of the smart dialogue communication between the smartdialogue communication platform and the requester terminal device may befound elsewhere in the present disclosure (e.g., FIG. 6 and descriptionsthereof). In some embodiments, during the smart dialogue communication,the smart dialogue communication platform may continuously obtainincoming voice message(s) from the requester terminal device and/ortransmit reply voice message(s) to the requester terminal device.

In 1807, the smart dialogue communication platform (e.g., the processingdevice 112) may record voice information associated with the smartdialogue communication. In some embodiments, the voice information mayrefer to voice message(s) generated during the smart dialoguecommunication. For example, the voice information may include theincoming voice message(s) and/or the reply voice message(s). In someembodiments, the smart dialogue communication platform may store thevoice information in a storage device (e.g., the storage device 150, theROM 230, the RAM 240, the disk 270, or the storage 390, or an externalstorage device associated with the smart dialogue communication system100).

In 1809, the smart dialogue communication platform (e.g., the processingdevice 112) may convert the voice information into the text information.In some embodiments, the processing device 112 may convert the voiceinformation into text information based on a speech recognitiontechnique (also referred to as a speech to text technique). Moredescriptions of the speech recognition technique may be found elsewherein the present disclosure (e.g., process 603 of the process 600 and thedescriptions thereof). The voice information may be input into thespeech recognition model, and the speech recognition model may generateand/or output the text information.

In 1811, the smart dialogue communication platform (e.g., the processingdevice 112) may transmit the text information to the responder terminaldevice. The responder terminal device may receive the text informationand/or display the text information on a display device. The user of theresponder terminal device can timely check the text information.

In some embodiments of the present disclosure, the smart dialoguecommunication may be performed when the user is not convenient to answerthe incoming call request (e.g., the user is a deaf mute). In thepresent disclosure, when an incoming call request is received, theresponder terminal device may first determine whether the smart dialoguecondition is satisfied. If the smart dialogue condition is satisfied,the smart dialogue communication may be performed, thereby saving thetime of the user and improving the response efficiency. Besides, thevoice information generated during the smart dialogue communication maybe recorded, thereby effectively avoiding the omission of information(e.g., missing important incoming call request) and greatly improvingthe user experiences. In the present disclosure, a user of the responderterminal device may not need to preset call numbers for call transferand no additional manual operation is required, thereby providingrelatively high flexibility.

It should be noted that the above description regarding the process 1800is merely provided for the purposes of illustration, and not intended tolimit the scope of the present disclosure. For persons having ordinaryskills in the art, multiple variations and modifications may be madeunder the teachings of the present disclosure. However, those variationsand modifications do not depart from the scope of the presentdisclosure. For example, operation 1809 may be omitted. The smartdialogue communication platform may directly transmit the voiceinformation to the responder terminal device. The responder terminaldevice may receive the voice information and broadcast the voiceinformation via a voice player of the responder terminal device. Asanother example, the smart dialogue communication platform may recordtext information (e.g., a dialogue record including incoming textmessages, question information, and/or answer information) generatedbased on the voice information associated with the smart dialoguecommunication, and/or transmit the generated text information to theresponder terminal device. In some embodiments, operations 1807-1811 maybe repeated during the smart dialogue communication, and a user of theresponder terminal device can check the text information in real-time.

FIG. 18B is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. At least a portion of process 1850 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 1850 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 1850 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 1850 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 1850 as illustrated in FIG. 18B and described below is notintended to be limiting.

In 1851, a smart dialogue communication platform (e.g., the processingdevice 112) may receive an incoming call request initiated by arequester via a requester terminal device. In some embodiments, theincoming call request may directly be transmitted to the smart dialoguecommunication other than a responder terminal device. In someembodiments, whether the incoming call request satisfies a smartdialogue condition may be determined by the smart dialogue communicationplatform. Alternatively, whether the incoming call request satisfies thesmart dialogue condition may be determined by the responder terminaldevice. For example, in 1853, the smart dialogue communication platform(e.g., the processing device 112) may transmit an incoming number of theincoming call request to a responder terminal device so that theresponder terminal device determines whether the incoming call requestsatisfies a smart dialogue condition based on the incoming call number.More descriptions of the smart dialogue condition may be found elsewherein the present disclosure (e.g., FIG. 18 and the descriptions thereof).In some embodiments, if the incoming call request satisfies the smartdialogue condition, the responder terminal device may generate adialogue request configured to request a smart dialogue communicationbased on the incoming call request. The responder terminal device maytransmit the dialogue request to the smart dialogue communicationplatform.

In 1855, the smart dialogue communication platform (e.g., the processingdevice 112) may perform a smart dialogue communication with therequester terminal device associated with the requester when theincoming call request satisfies the smart dialogue condition. Moredescriptions of the smart dialogue communication may be found elsewherein the present disclosure (e.g., FIG. 6 and the descriptions thereof).

In 1857, the smart dialogue communication platform (e.g., the processingdevice 112) may record voice information associated with the smartdialogue communication. In 1859, the smart dialogue communicationplatform (e.g., the processing device 112) may convert the voiceinformation into text information. Operations 1857 and 1859 may beperformed in a similar manner with operations 1807 and 1809, and thedescriptions are not repeated here.

It should be noted that the above description of the process 1850 isprovided for the purposes of illustration, and is not intended to limitthe scope of the present disclosure. For persons having ordinary skillsin the art, multiple variations and modifications may be made under theteachings of the present disclosure. However, those variations andmodifications do not depart from the scope of the present disclosure.For example, operation 1853 may be omitted. The smart dialoguecommunication platform (e.g., the processing device 112) may determinewhether the incoming call request satisfies the smart dialoguecondition, and perform the smart dialogue communication when theincoming call request satisfies the smart dialogue condition. As anotherexample, the process 1850 may include one or more additional operations(such as a transmission operation). For instance, in the transmissionoperation, the smart dialogue communication platform (e.g., theprocessing device 112) may transmit the text information to theresponder terminal device.

FIG. 19 is a schematic diagram illustrating an exemplary system forrecording text information associated with a smart dialoguecommunication according to some embodiments of the present disclosure.It should be noted that the functions of the system are provided forillustration purposes, and are not intended to limit the scope of thepresent disclosure.

As shown in FIG. 19, a responder terminal device (e.g., the responderterminal device 140) may receive an incoming call request 1901 from arequester terminal device (e.g., the requester terminal device 130). In1903, the responder terminal device may determine whether an incomingcall number of the incoming call request exists in a contact list of theresponder terminal device. If the incoming call number does not exist inthe contact list, the responder terminal device may determine theincoming call number as an unknown number 1911 (or strange number). Insome embodiments, the responder terminal device may determine whetherthe incoming call number exists in a blacklist of the responder terminaldevice. If the incoming call number exists in the blacklist 1913, theresponder terminal device may refuse the incoming call request, generateincoming call information based on the incoming call request, and/oroutput the incoming call information. The incoming call information mayinclude: the incoming call number, the name of the requester, theincoming call time, the reason of being added into the blacklist,whether to be removed from the blacklist, or the like, or anycombination thereof. In some embodiments, the incoming call informationmay be displayed on a display device of the responder terminal device.If the incoming call number does not exist in the blacklist 1913, in1917, the responder terminal device may determine that the incoming callrequest satisfies the smart dialogue condition.

Alternatively, if the incoming call number exists in the contact list1905, the responder terminal device may determine whether the incomingcall number exists in the blacklist 1913. If the incoming call numberexists in the blacklist 1913, the responder terminal device may refusethe incoming call request, generate incoming call information based onthe incoming call request, and/or output the incoming call information.In some embodiments, the incoming call information may be displayed on adisplay device of the responder terminal device. If the incoming callnumber exists in a white list 1907 of the responder terminal device, theresponder terminal device may output (e.g., display) the incoming callnumber. In 1909, a user of the responder terminal device may respond tothe incoming call request (e.g., answering the call, refusing the call,etc.). In some embodiments, in 1915, if the user does not respond to theincoming call request (e.g., the request time length exceeds a timethreshold), the responder terminal device may determine that theincoming call request satisfies the smart dialogue condition in 1917.

If the incoming call request satisfies the smart dialogue condition, theresponse terminal device may transmit the incoming call request to asmart dialogue communication platform, or the response terminal devicemay generate a dialogue request based on the incoming call request andtransmit the dialogue request to the smart dialogue communicationplatform. The smart dialogue communication platform may perform thesmart dialogue communication with the requester terminal device based ona knowledge database (e.g., a question knowledge database and/or ananswer knowledge database). More descriptions of the smart dialoguecommunication may be found elsewhere in the present disclosure (e.g.,FIG. 6 and the descriptions thereof). During the smart dialoguecommunication, text information 1919 may be generated and/or recorded bythe smart dialogue communication platform, and/or transmitted to theresponder terminal device. The responder terminal device may display thetext information 1919 on a display device. In some embodiments, theresponder terminal device may remind the user to check the textinformation. The user can check the text information in real-time orafter a time period.

FIG. 20A is a schematic diagram illustrating an exemplary device forrecording text information associated with a smart dialoguecommunication according to some embodiments of the present disclosure.The device (also referred to as a text information record device) 2000may be implemented on the computing device (e.g., the processor 220 ofthe computing device 200) as illustrated in FIG. 2, and/or the CPU 340as illustrated in FIG. 3. As shown in FIG. 20A, the device 2000 mayinclude a request receiving module 2002, a condition determinationmodule 2004, a smart dialogue communication module 2006, an informationcollection module 2008, a text conversion module 2010, and a texttransmission module 2012. In some embodiments, the request receivingmodule 2002 and/or the condition determination module 2004 may beimplemented on a responder terminal device (e.g., the responder terminaldevice 140). In some embodiments, the smart dialogue communicationmodule 2006, the information collection module 2008, the text conversionmodule 2010, and/or the text transmission module 2012 may be implementedon a smart dialogue communication platform.

The request receiving module 2002 may be configured to receive anincoming call request initiated by a requester via a requester terminaldevice (e.g., the requester terminal device 130). The conditiondetermination module 2004 may be configured to determine whether theincoming call request satisfies a smart dialogue condition. The smartdialogue communication module 2006 may be configured to perform a smartdialogue communication with the requester terminal device associatedwith the requester. The information collection module 2008 may beconfigured to record voice information associated with the smartdialogue communication. The text conversion module 2010 may beconfigured to convert the voice information into text information. Thetext transmission module 2012 may be configured to transmit the textinformation to the responder terminal device.

The device 2000 in the present disclosure may be used to perform one ormore operations of process 1800 as described in FIG. 18A. Theimplementation principle and technical effects of the device 2000 may besimilar to the operations illustrated in the process 1800. In someembodiments, the device 2000 may be an exemplary embodiment of theprocessing device 112.

FIG. 20B is a schematic diagram illustrating an exemplary device forrecording text information associated with a smart dialoguecommunication according to some embodiments of the present disclosure.The device (also referred to as a text information record device) 2500may be implemented on the computing device (e.g., the processor 220 ofthe computing device 200) as illustrated in FIG. 2, and/or the CPU 340as illustrated in FIG. 3. As shown in FIG. 20B, the device 2500 mayinclude a request receiving module 2502, an information transmissionmodule 2504, a smart dialogue communication module 2506, an informationcollection module 2508, and a text conversion module 2510. In someembodiments, the request receiving module 2502, the informationtransmission module 2504, the smart dialogue communication module 2506,the information collection module 2508, and the text conversion module2510 may be implemented on a smart dialogue communication platform.

The request receiving module 2502 may be configured to receive anincoming call request initiated by a requester via a requester terminaldevice (e.g., the requester terminal device 130). The informationtransmission module 2504 may be configured to transmit an incomingnumber of the incoming call request to a responder terminal device sothat the responder terminal device determines whether the incoming callrequest satisfies a smart dialogue condition based on the incoming callnumber. The smart dialogue communication module 2506 may be configuredto perform a smart dialogue communication with the requester terminaldevice associated with the requester when the incoming call requestsatisfies the smart dialogue condition. The information collectionmodule 2508 may be configured to record voice information associatedwith the smart dialogue communication. The text conversion module 2510may be configured to convert the voice information into textinformation.

The device 2500 in the present disclosure may be used to perform one ormore operations of process 1850 as described in FIG. 18B. Theimplementation principle and technical effects of the device 2500 may besimilar to the operations illustrated in the process 1850. In someembodiments, the device 2000 may be an exemplary embodiment of theprocessing device 112.

FIG. 21 is a flowchart illustrating an exemplary process for smartdialogue communication according to some embodiments of the presentdisclosure. At least a portion of process 2100 may be implemented on thecomputing device 200 as illustrated in FIG. 2 or the mobile device 300as illustrated in FIG. 3. In some embodiments, one or more operations ofthe process 2100 may be implemented in the smart dialogue communicationsystem 100 as illustrated in FIG. 1. In some embodiments, one or moreoperations in the process 2100 may be stored in the storage device 150and/or the storage (e.g., the ROM 230, the RAM 240, etc.) as a form ofinstructions, and invoked and/or executed by the server 110 (e.g., theprocessing device 112 in the server 110 illustrated in FIG. 1, theprocessor 220 illustrated in FIG. 2, the CPU 340 illustrated in FIG. 3,or one or more modules in the processing device 112 illustrated in FIG.4). In some embodiments, the instructions may be transmitted in a formof electronic current or electrical signals. The operations of theillustrated process presented below are intended to be illustrative. Insome embodiments, the process 2100 may be accomplished with one or moreadditional operations not described, and/or without one or more of theoperations discussed. Additionally, the order in which the operations ofthe process 2100 as illustrated in FIG. 21 and described below is notintended to be limiting.

In 2010, a responder terminal device (e.g., the responder terminaldevice 140) may receive an incoming call request initiated by arequester via a requester terminal device (e.g., the requester terminaldevice 130). The incoming call request may be transmitted to therequester terminal device via a network (e.g., the network 120).

In 2103, the responder terminal device (e.g., the responder terminaldevice 140) may determine whether the incoming call request satisfies asmart dialogue condition. If the incoming call request satisfies thesmart dialogue condition, the responder terminal device may generate adialogue request configured to request a smart dialogue communicationbased on the incoming call request. The responder terminal device mayfurther transmit the dialogue request to a smart dialogue communicationplatform.

In some embodiments, the responder terminal device may determine anincoming call number based on the incoming call request, and/ordetermine whether the incoming call number exists in a contact list ofthe responder terminal device. In response to determining that theincoming call number exists in the contact list, the responder terminaldevice may determine identity information of the requester based on apreset identity recorded in the contact list (e.g., a name of therequester). In some embodiments, the responder terminal device maydisplay the identity information of the requester via a display device(e.g., output the identity information for display). Alternatively, inresponse to determining that the incoming call number does not exist inthe contact list, the responder terminal device may determine that theincoming call request satisfies the smart dialogue condition.

In some embodiments, before determining whether the incoming callrequest satisfies the smart dialogue condition, the responder terminaldevice may determine whether the incoming call number is labelled. Inresponse to determining that the incoming call number is labelled, theresponder terminal device may obtain the identity information of therequester based on a labelled identity associated with the incoming callnumber (e.g., a profession of the requester such as a seller, a customerservice agent, etc.). The responder terminal device may display theidentity information of the requester via a display device (e.g., outputthe identity information for display). Alternatively, in response todetermining that the incoming call number is not labelled, the responderterminal device may determine whether the incoming call requestsatisfies the smart dialogue condition. More descriptions of thedetermination of whether the incoming call request satisfies the smartdialogue condition may be found elsewhere in the present disclosure(e.g., operation 1803 of process 1800 and the relevant descriptionsthereof).

In 2105, a smart dialogue communication platform (e.g., the processingdevice 112) may perform a smart dialogue communication with therequester terminal device associated with the requester. Moredescriptions of the smart dialogue communication between the smartdialogue communication platform and the requester terminal device may befound elsewhere in the present disclosure (e.g., FIG. 6 and descriptionsthereof). In some embodiments, during the smart dialogue communication,the smart dialogue communication platform may continuously obtainincoming voice message(s) from the requester terminal device and/ortransmit reply voice message(s) to the requester terminal device.

In 2107, if the identity information of the requester is not identifiedbefore the smart dialogue communication, the smart dialoguecommunication platform (e.g., the processing device 112) may obtain oneor more incoming voice messages associated with the smart dialoguecommunication from the requester terminal device. The incoming voicemessages may include voice messages of the requester. The smart dialoguecommunication platform may store the incoming voice messages in astorage device (e.g., the storage device 150, the ROM 230, the RAM 240,the disk 270, or the storage 390, or an external storage deviceassociated with the smart dialogue communication system 100).Alternatively or additionally, the smart dialogue communication platformmay store the incoming voice message(s) and the reply voice message(s)in the storage device.

In 2109, the smart dialogue communication platform (e.g., the processingdevice 112) may determine one or more feature words based on the one ormore incoming voice messages. In some embodiments, the smart dialoguecommunication platform may convert the incoming voice message(s) intoincoming text message(s). The smart dialogue communication platform maydetermine the feature word(s) by performing feature extraction on theincoming text message(s), e.g., using a language model. Thedetermination of the feature words may be similar to the determinationof the text features of the incoming text message(s) as illustrated inoperation 605 of process 600, and the descriptions are not repeated.More descriptions of the determination of the feature word(s) may befound elsewhere in the present disclosure (e.g., FIG. 6 and descriptionsthereof). In some embodiments, the feature words may include one or morewords that can indicate or imply the identity information of therequester, such as real estate sells, stocks, banking, game promotion,or the like, or any combination thereof.

In 2111, the smart dialogue communication platform (e.g., the processingdevice 112) may determine identity information of the requester bymatching the one or more feature words in an identity database. In someembodiments, the identity database may store a plurality of identitytypes. In some embodiments, the identity database may be stored in astorage device associated with the smart dialogue communication platform(e.g., the storage device 150, the ROM 230, the RAM 240, the disk 270,the storage 390, etc.). Alternatively, the identity database may bestored in an external storage device, and the processing device 112 mayretrieve the identity database via a network (e.g., the network 120).The identity database may be preset by the smart dialogue communicationsystem 100, or self-defined by a user of the responder terminal device140. In some embodiments, the smart dialogue communication platform maymatch the feature words in the identity database using a fourth semanticmatching model (e.g., a deep structured semantic model (DSSM), a longshort term memory deep structured semantic model (LSTM-DSSM), anenhanced LSTM for natural language inference (ESIM), an attention-basedconvolutional neural network for modeling sentence pairs (ABCNN), etc.).For example, the smart dialogue communication platform may input thefeature words and the identity types within the identity database intothe fourth semantic matching model. The fourth semantic matching modelmay generate and/or output a matching degree between the feature wordsand a (e.g., each) identity type. In some embodiments, the smartdialogue communication platform may determine the identity informationbased on the matching degree(s). For example, the smart dialoguecommunication platform may determine the identity type having thehighest matching degree as the identity information.

In some embodiments, the identity information cannot be obtained bymatching the one or more feature words in the identity database (e.g.,no identity type in the identity database matches with the featurewords). In such cases, the smart dialogue communication platform maycreate a new identity based on the feature words and determine theidentity information based on the new identity. In some embodiments, tocreate the new identity, the smart dialogue communication platform maydetermine an incoming call number based on the incoming call request;obtain a number structure of the incoming call number; determineattribute information of the incoming call number based on the numberstructure; and/or create the new identity based on the attributeinformation and the one or more feature words.

In some embodiments, the incoming call number may have a specific numberstructure. For example, a mobile incoming call number may apply a mobiledirectory number (MDN) structure with a E.164 encoding form. The MDNstructure may include three parts: CC+MAC+SN (H0H1H2H3ABCD), in which CCrefers to the country code (e.g., 86 representing China); MAC refers tothe mobile access code (e.g., 153/189 representing China Telecom); SNrefers to the subscriber number; H0H1H2H3 refers to the identifier ofhome location register (HLR) that is associated with location; and ABCDrefers to a user freedom number that is allocated by the HLR. Thus, theattribute information (e.g., location information) of the incoming callnumber may be determined based on the number structure. Further, the newidentity may be created based on the attribute information and/or thefeature words. In some embodiments, the smart dialogue communicationplatform may update the identity database by adding information relatingto the new identity into the identity database.

In 2113, the smart dialogue communication platform (e.g., the processingdevice 112) may transmit the identity information to the responderterminal device. In some embodiments, the responder terminal device maydisplay the identity information on a display device of the responderterminal device. In some embodiments, the identity information mayinclude: the incoming call number, the identity of the requester, theincoming call time, or the like, or any combination thereof. In someembodiments, the user can decide whether to intervene in the smartdialogue communication. For example, if the identity informationindicates that the incoming call request is a harassing call request,the user may send an instruction to the smart dialogue communicationplatform to end (or terminate) the smart dialogue communication. Asanother example, if the identity information indicates that the incomingcall request is an important call request, the user may directlyintervene in the communication with the requester via the responderterminal device. As a further example, if the identity informationindicates that the incoming call request is not a harassing call requestand an important call request, the user may not intervene in the smartdialogue communication.

In some embodiments of the present disclosure, the smart dialoguecommunication may be performed when the user is not convenient to answerthe incoming call request. In the present disclosure, when an incomingcall request is received, the responder terminal device may firstdetermine whether the smart dialogue condition is satisfied. If thesmart dialogue condition is satisfied, the smart dialogue communicationmay be performed, thereby saving the time of the user and improving theresponse efficiency. Besides, the identity information of the requesteris identified, and the user can decide whether to intervene in the smartdialogue communication, thereby reducing or filtrating harassing callswhile not missing important calls, and improving user experiences.

It should be noted that the above description regarding the process 2100is merely provided for the purposes of illustration, and not intended tolimit the scope of the present disclosure. For persons having ordinaryskills in the art, multiple variations and modifications may be madeunder the teachings of the present disclosure. However, those variationsand modifications do not depart from the scope of the presentdisclosure. In some embodiments, the identify information may beconverted into voice information, and the responder terminal device maybroadcast the identity information (e.g., voice information) via a voiceplayer of the responder terminal device.

FIG. 22 is a schematic diagram illustrating an exemplary system foridentifying identity information of a requester according to someembodiments of the present disclosure. It should be noted that thefunctions of the system are provided for illustration purposes, and arenot intended to limit the scope of the present disclosure.

As shown in FIG. 22, a responder terminal device (e.g., the responderterminal device 140) may receive an incoming call request 2201 from arequester terminal device (e.g., the requester terminal device 130). In2203, the responder terminal device may determine whether an incomingcall number of the incoming call request exists in a contact list of theresponder terminal device. If the incoming call number exist in thecontact list 2205, the responder terminal device may identify a presetidentity 2207 of the requester in the contact list, and/or determine theidentity information 2223 based on the preset identity 2207 in thecontact list. The responder terminal device may display the identityinformation 2223 via a display device (e.g., output the identityinformation for display). In some embodiments, the identity information2223 may include information relating to a profession (or occupation) ofthe requester, a field of work of the requester, a company that therequester belongs, a position (or title) of the requester, contactinformation of the requester, or the like. Alternatively, if theincoming call number does not exist in the contact list, the responderterminal device may determine the incoming call number as an unknownnumber 2211 (or strange number).

The responder terminal device may further determine whether the incomingcall number is labelled. If the incoming call number is a labelledincoming call number 2209 (e.g., including a labelled identity), theresponder terminal device may determine the identity information 2223based on the labelled incoming call number 2209 (e.g., the labelledidentity). The responder terminal device may display the identityinformation 2223 via the display device (e.g., output the identityinformation for display). Alternatively, if the incoming call number isnot labelled, in 2213, the responder terminal device may determine thatthe incoming call request satisfies the smart dialogue condition.

If the incoming call request satisfies the smart dialogue condition, theresponse terminal device may transmit the incoming call request to asmart dialogue communication platform. The smart dialogue communicationplatform may perform the smart dialogue communication with the requesterterminal device based on a knowledge database (e.g., a questionknowledge database and/or an answer knowledge database). Moredescriptions of the smart dialogue communication may be found elsewherein the present disclosure (e.g., FIG. 6 and the descriptions thereof).During the smart dialogue communication, in 2221, the smart dialoguecommunication platform may obtain one or more incoming voice messages,and/or extract one or more feature words based on the one or moreincoming voice messages. In 2219, the smart dialogue communicationplatform may match the one or more feature words in the identitydatabase 2215. If the feature words are matched, the smart dialoguecommunication platform may determine the identity information 2223,and/or transmit the identity information 2223 to the responder terminaldevice. The responder terminal device may display the identityinformation 2223 via the display device (e.g., output the identityinformation). Alternatively, if the feature words are not matched in theidentity database, in 2217, the smart dialogue communication platformmay create a new identity based on the one or more feature words and/ordetermine the identity information 2223 based on the new identity. Insome embodiments, the smart dialogue communication platform may furtherupdate the identity database by adding information relating to the newidentity into the identity database 2215.

FIG. 23 is a schematic diagram illustrating an exemplary device foridentifying identity information of a requester according to someembodiments of the present disclosure. The device (also referred to asan identity identification device) 2300 may be implemented on thecomputing device (e.g., the processor 220 of the computing device 200)as illustrated in FIG. 2, and/or the CPU 340 as illustrated in FIG. 3.As shown in FIG. 23, the device 2300 may include a request receivingmodule 2302, a condition determination module 2304, a smart dialoguecommunication module 2306, an information collection module 2308, afeature word obtaining module 2310, an information matching module 2312,an information transmission module 2314. In some embodiments, therequest receiving module 2002 and/or the condition determination module2004 may be implemented on a responder terminal device (e.g., theresponder terminal device 140). In some embodiments, the smart dialoguecommunication module 2306, the information collection module 2308, thefeature word obtaining module 2310, the information matching module2312, and/or the information transmission module 2314 may be implementedon a smart dialogue communication platform.

The request receiving module 2302 may be configured to receive anincoming call request initiated by a requester via a requester terminaldevice (e.g., the requester terminal device 130). The conditiondetermination module 2304 may be configured to determine whether theincoming call request satisfies a smart dialogue condition. The smartdialogue communication module 2306 may be configured to perform a smartdialogue communication with the requester terminal device associatedwith the requester. The information collection module 2308 may beconfigured to obtain one or more incoming voice messages associated withthe smart dialogue communication. The feature word obtaining module 2310may be configured to determine one or more feature words based on theone or more incoming voice messages. The information matching module2312 may be configured to determine the identity information by matchingthe one or more feature words in an identity database. The informationtransmission module 2314 may be configured to transmit the identityinformation to the responder terminal device.

The device 2300 in the present disclosure may be used to perform one ormore operations of process 2100 as described in FIG. 21. Theimplementation principle and technical effects of the device 2300 may besimilar to the operations illustrated in the process 2100. In someembodiments, the device 2300 may be an exemplary embodiment of theprocessing device 112.

FIG. 24 is a schematic diagram illustrating an exemplary deviceaccording to some embodiment of the present disclosure. In someembodiments, the device 2400 may include a storage device 2401 and aprocessor 2403. The storage device 2401 may be configured to storecomputer programs. The processor 2403 may be configured to execute thecomputer programs to implement one or more operations of the processes500-600, 800-900, 1200, 1500, 1800, and 2100 described above. The device2400 may further include a display device 2405. The display device 2405may be configured to display text information (such as the scheduleinformation, the emotion type, the identity information, the dialoguerecord, or the like).

The present disclosure also provides a computer-readable storage medium.The computer-readable storage medium may store computer programs. Thecomputer programs may be executed by a processor to perform theoperations of the processes 500-600, 800-900, 1200, 1500, 1800, and 2100described above. In some embodiments, the computer-readable storagemedium may include a U disk, a mobile hard disk, a read only memory(ROM), a random access memory (RAM), a magnetic disk, an optical disk,or any other medium that can store program codes.

Having thus described the basic concepts, it may be rather apparent tothose skilled in the art after reading this detailed disclosure that theforegoing detailed disclosure is intended to be presented by way ofexample only and is not limiting. Various alterations, improvements, andmodifications may occur and are intended to those skilled in the art,though not expressly stated herein. These alterations, improvements, andmodifications are intended to be suggested by this disclosure, and arewithin the spirit and scope of the exemplary embodiments of thisdisclosure.

Moreover, certain terminology has been used to describe embodiments ofthe present disclosure. For example, the terms “one embodiment,” “anembodiment,” and/or “some embodiments” mean that a particular feature,structure or characteristic described in connection with the embodimentis included in at least one embodiment of the present disclosure.Therefore, it is emphasized and should be appreciated that two or morereferences to “an embodiment” or “one embodiment” or “an alternativeembodiment” in various portions of this specification are notnecessarily all referring to the same embodiment. Furthermore, theparticular features, structures or characteristics may be combined assuitable in one or more embodiments of the present disclosure.

Further, it will be appreciated by one skilled in the art, aspects ofthe present disclosure may be illustrated and described herein in any ofa number of patentable classes or context including any new and usefulprocess, machine, manufacture, or composition of matter, or any new anduseful improvement thereof. Accordingly, aspects of the presentdisclosure may be implemented entirely hardware, entirely software(including firmware, resident software, micro-code, etc.) or combiningsoftware and hardware implementation that may all generally be referredto herein as a “unit,” “module,” or “system.” Furthermore, aspects ofthe present disclosure may take the form of a computer program productembodied in one or more computer readable media having computer readableprogram code embodied thereon.

A computer readable signal medium may include a propagated data signalwith computer readable program code embodied therein, for example, inbaseband or as part of a carrier wave. Such a propagated signal may takeany of a variety of forms, including electromagnetic, optical, or thelike, or any suitable combination thereof. A computer readable signalmedium may be any computer readable medium that is not a computerreadable storage medium and that may communicate, propagate, ortransport a program for use by or in connection with an instructionexecution system, apparatus, or device. Program code embodied on acomputer readable signal medium may be transmitted using any appropriatemedium, including wireless, wireline, optical fiber cable, RF, or thelike, or any suitable combination of the foregoing.

Computer program code for carrying out operations for aspects of thepresent disclosure may be written in any combination of one or moreprogramming languages, including an object oriented programming languagesuch as Java, Scala, Smalltalk, Eiffel, JADE, Emerald, C++, C#, VB. NET,Python or the like, conventional procedural programming languages, suchas the “C” programming language, Visual Basic, Fortran 2003, Perl, COBOL2002, PHP, ABAP, dynamic programming languages such as Python, Ruby andGroovy, or other programming languages. The program code may executeentirely on the user's computer, partly on the user's computer, as astand-alone software package, partly on the user's computer and partlyon a remote computer or entirely on the remote computer or server. Inthe latter scenario, the remote computer may be connected to the user'scomputer through any type of network, including a local area network(LAN) or a wide area network (WAN), or the connection may be made to anexternal computer (for example, through the Internet using an InternetService Provider) or in a cloud computing environment or offered as aservice such as a Software as a Service (SaaS).

Furthermore, the recited order of processing elements or sequences, orthe use of numbers, letters, or other designations therefore, is notintended to limit the claimed processes and methods to any order exceptas may be specified in the claims. Although the above disclosurediscusses through various examples what is currently considered to be avariety of useful embodiments of the disclosure, it is to be understoodthat such detail is solely for that purpose, and that the appendedclaims are not limited to the disclosed embodiments, but, on thecontrary, are intended to cover modifications and equivalentarrangements that are within the spirit and scope of the disclosedembodiments. For example, although the implementation of variouscomponents described above may be embodied in a hardware device, it mayalso be implemented as a software only solution, e.g., an installationon an existing server or mobile device.

Similarly, it should be appreciated that in the foregoing description ofembodiments of the present disclosure, various features are sometimesgrouped together in a single embodiment, figure, or description thereoffor the purpose of streamlining the disclosure aiding in theunderstanding of one or more of the various embodiments. This method ofdisclosure, however, is not to be interpreted as reflecting an intentionthat the claimed subject matter requires more features than areexpressly recited in each claim. Rather, claimed subject matter may liein less than all features of a single foregoing disclosed embodiment.

What is claimed is:
 1. A system for smart dialogue communication,comprising: at least one storage device including a set of instructions;and at least one processor in communication with the at least onestorage device, wherein when executing the set of instructions, the atleast one processor is configured to cause the system to performoperations including: receiving, from a responder terminal device, adialogue request configured to request a smart dialogue communication,wherein the dialogue request is associated with an incoming call requestthat is initiated by a requester via a requester terminal device andsatisfies a smart dialogue condition determined by the responderterminal device; performing the smart dialogue communication with therequester terminal device associated with the requester; recording voiceinformation associated with the smart dialogue communication; convertingthe voice information into the text information; and transmitting thetext information to the responder terminal device obtaining, from therequester terminal device, one or more first incoming voice messagesassociated with the smart dialogue communication; determining one ormore feature words based on the one or more first incoming voicemessages; and determining identity information by matching the one ormore feature words in an identity database.
 2. The system of claim 1,wherein the smart dialogue condition is associated with at least one of:whether an incoming call number associated with the incoming callrequest exists in a contact list of the responder terminal device;whether the incoming call number exists in a blacklist of the responderterminal device; or whether a request time length of the incoming callrequest exceeds a time threshold.
 3. The system of claim 1, wherein theat least one processor is further configured to cause the system toperform operations including: obtaining, from the requester terminaldevice, one or more first incoming voice messages associated with thesmart dialogue communication; determining one or more feature wordsbased on the one or more first incoming voice messages; and determiningidentity information by matching the one or more feature words in anidentity database recording voice information associated with the smartdialogue communication; converting the voice information into the textinformation; and transmitting the text information to the responderterminal device.
 4. The system of claim 1, wherein performing the smartdialogue communication includes: obtaining, from the requester terminaldevice, one or more second incoming voice messages; converting the oneor more second incoming voice messages into one or more incoming textmessages; determining text features by performing feature extraction onat least one of the one or more incoming text messages; determiningquestion information associated with the text features by matching thetext features in a question knowledge database; obtaining answerinformation corresponding to the question information by matching thequestion information in an answer knowledge database; converting theanswer information into at least one reply voice message; andtransmitting the at least one reply voice message to the requesterterminal device.
 5. The system of claim 3, wherein the at least oneprocessor is further configured to cause the system to performoperations including: determining sentiment features by performingsentiment analysis on the text information; determining mood features byperforming feature extraction on the text information; determiningintegrated features by integrating the sentiment features and the moodfeatures; determining an emotion type by matching the integratedfeatures in an emotion database; and transmitting the emotion type tothe responder terminal device associated with the incoming call request.6. The system of claim 5, wherein the at least one processor is furtherconfigured to cause the system to perform operations including:generating suggestion information based on the emotion type, thesuggestion information being associated with whether to respond to theincoming call request; and transmitting the suggestion information tothe responder terminal device.
 7. The system of claim 1, wherein the atleast one processor is further configured to cause the system to performoperations including: determining whether prompt information isreceived, the prompt information being associated with a replyinstruction provided by the responder terminal device; and in responseto determining that the prompt information is received, converting theprompt information into a reply voice message; and transmitting thereply voice message to the requester terminal device.
 8. The system ofclaim 7, wherein before performing the smart dialogue communication, theat least one processor is further configured to cause the system toperform operations including: determining whether an answer instructioncorresponding to the incoming call request is received from theresponder terminal device; and in response to determining that theanswer instruction is not received, performing the smart dialoguecommunication.
 9. The system of claim 3, wherein the at least oneprocessor is further configured to cause the system to performoperations including: determining event keywords by performing semanticanalysis on the text information; generating schedule information basedon the event keywords; and transmitting the schedule information to theresponder terminal device associated with the incoming call request fordisplay.
 10. The system of claim 9, wherein the determining the eventkeywords by performing semantic analysis on the text informationincludes: processing the text information by performing wordsegmentation and annotation on the text information; determining textfeatures by performing feature extraction on the processed textinformation using a language model; and determining the event keywordsby performing semantic analysis on the text features using a semanticanalysis model.
 11. The system of claim 10, wherein the determining theevent keywords includes: obtaining time information by performing thesemantic analysis on the text information; determining whether the timeinformation includes an absolute time; and in response to determiningthat the time information includes the absolute time, designating thetime information as an event time; or in response to determining thatthe time information includes no absolute time, determining the eventtime based on the time information and a system time.
 12. The system ofclaim 9, wherein the at least one processor is further configured tocause the system to perform operations including: determining anemergency level of the schedule information according to a levelclassification rule; determining a reminder time of the scheduleinformation based on the emergency level; and causing the responderterminal device to generate a reminder at the reminder time.
 13. Amethod for smart dialogue communication, comprising: receiving, from aresponder terminal device, a dialogue request configured to request asmart dialogue communication, wherein the dialogue request is associatedwith an incoming call request that is initiated by a requester via arequester terminal device and satisfies a smart dialogue conditiondetermined by the responder terminal device; performing the smartdialogue communication with the requester terminal device associatedwith the requester; obtaining, from the requester terminal device, oneor more first incoming voice messages associated with the smart dialoguecommunication; determining one or more feature words based on the one ormore first incoming voice messages; and determining identity informationby matching the one or more feature words in an identity database.recording voice information associated with the smart dialoguecommunication; converting the voice information into the textinformation; and transmitting the text information to the responderterminal device.
 14. The method of claim 13, wherein the smart dialoguecondition is associated with at least one of: whether an incoming callnumber associated with the incoming call request exists in a contactlist of the responder terminal device; whether the incoming call numberexists in a blacklist of the responder terminal device; or whether arequest time length of the incoming call request exceeds a timethreshold.
 15. The method of claim 13, further comprising: obtaining,from the requester terminal device, one or more first incoming voicemessages associated with the smart dialogue communication; determiningone or more feature words based on the one or more first incoming voicemessages; and determining identity information by matching the one ormore feature words in an identity database recording voice informationassociated with the smart dialogue communication; converting the voiceinformation into text information; and transmitting the text informationto the responder terminal device.
 16. The method of claim 13, whereinperforming the smart dialogue communication includes: obtaining, fromthe requester terminal device, one or more second incoming voicemessages; converting the one or more second incoming voice messages intoone or more incoming text messages; determining text features byperforming feature extraction on at least one of the one or moreincoming text messages; determining question information associated withthe text features by matching the text features in a question knowledgedatabase; obtaining answer information corresponding to the questioninformation by matching the question information in an answer knowledgedatabase; converting the answer information into at least one replyvoice message; and transmitting the at least one reply voice message tothe requester terminal device.
 17. The method of claim 15, furthercomprising: determining sentiment features by performing sentimentanalysis on the text information; determining mood features byperforming feature extraction on the text information; determiningintegrated features by integrating the sentiment features and the moodfeatures; determining an emotion type by matching the integratedfeatures in an emotion database; and transmitting the emotion type tothe responder terminal device associated with the incoming call request.18. The method of claim 17, further comprising: generating suggestioninformation based on the emotion type, the suggestion information beingassociated with whether to respond to the incoming call request; andtransmitting the suggestion information to the responder terminaldevice.
 19. The method of claim 13, further comprising: determiningwhether prompt information is received, the prompt information beingassociated with a reply instruction provided by the responder terminaldevice; and in response to determining that the prompt information isreceived, converting the prompt information into a reply voice message;and transmitting the reply voice message to the requester terminaldevice.
 20. A non-transitory readable medium, comprising at least oneset of instructions for smart dialogue communication, wherein whenexecuted by at least one processor of an electrical device, the at leastone set of instructions directs the at least one processor to perform amethod, the method comprising: receiving, from a responder terminaldevice, a dialogue request configured to request a smart dialoguecommunication, wherein the dialogue request is associated with anincoming call request that is initiated by a requester via a requesterterminal device and satisfies a smart dialogue condition determined bythe responder terminal device; performing the smart dialoguecommunication with the requester terminal device associated with therequester; obtaining, from the requester terminal device, one or morefirst incoming voice messages associated with the smart dialoguecommunication; determining one or more feature words based on the one ormore first incoming voice messages; and determining identity informationby matching the one or more feature words in an identity database.recording voice information associated with the smart dialoguecommunication; converting the voice information into the textinformation; and transmitting the text information to the responderterminal device.